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(57) Abstract: The present invention relates to the high bone mass {HBKd) gene, the coiresponding wild-type gene {Zmax2\ and 
mutants thereof. The genes identified in the present invention are implicated in regulation of physiological lipid levels, and thereby 
lipid-mediated diseases and conditions. The invention also provides nucleic adds, including coding sequences, oligonucleotide 
primers and probes, proteins, cloning vectors, expression vectors, transformed hosts, methods of developing pharmaceutical com- 
positions, methods of identifying molecules involved in lipid level regulation in a subject. In preferred embodiments, the present 
invention is directed to methods for treating and preventing atherosclerosis, arteriosclerosis cardiovascular disease, atherosclerotic 
and arteriosclerotic associated conditions. 
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REGULATING LIPID LEVELS 
VIA THE ZMAXl OR HBM GENE 

TNVKNTQRS : John P. Carulli, Randall D. Little, Robert R. Recker and Mark L. Johnson 

PVT ATKP APPLICATIONS 

This application is a continuation-in-part of Application No. 09/543,771 filed April 5, 
2000 and Application No. 09/544;398 filed April 5, 2000, which are continuation-in-part 
5 applications of Application No. 09/229,319, filed January 13, 1999, wHch claims benefit of 
U.S. Provisional AppUcationNo. 60/071,449, filed January 13, 1998, and U.S. Provisional 
Application No. 60/105,511, filed October 23, 1998. aU of which are herein mcofporated by 
reference in their entirety. 

FTELD OF THF. INVENTION 

10 The present invention relates generally to the field of genetics, genomics and 

molecular biology. More particularly, the invention relates to methods and materials used to 
isolate, detect and sequence a high bone mass gene and corresponding wild-type gene, and 
mutants thereof fliat may be involved with modulating lipid levels. The present invention 
also relates to the high bone mass gene, the corresponding wild-type gene, aad mutants 

15 thereof. The genes identified in the present invention are impUcated in the ontology and 
physiology of atherosclerosis, arteriosclerosis and associated diseases and conditions related 
thereto. The invention also provides nucleic acids, proteins, cloning vectors, expression 
vectors, transformed hosts, methods of developing pharmaceutical compositions, methods of • 
identifying molecules involved in arteriosclerosis and associated conditions, and methods of 

20 treating or preventing diseases associated with abnormal lipid levels. In preferred 
embodiments, the present invention is directed to methods for treating, diagnosing. 
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preventing and screening for normal and abnormal Hpid-associated conditions, including 
arteriosclerosis, cardiovascular disease and stroke. 

BACKGR OUND OF THE TIWFNTION 

5 Cardiovascular disease is the number one kiUer in the United States, and 

atherosclerosis is the major cause of heart disease and stroke. It is widely appreciated that 
cholesterol plays an important role in atherogenesis. Normally, most cholesterol serves as a 
. structural element in the walls of cells, whereas much of the rest is in transit through the 
blood or functions as the starting material for the syndesis of bile acids in the Kver, steroid 
10 hormones in endocrine ceUs and vitamin D in skin. The transport of cholesterol and other 
lipids through the circulatory system is facilitated by their packaging into Upoprotein carriers. 
These spherical particles comprise protein and phosphoUpid shells surrounding a core of 
neutral Hpid, including unesterified ("free") or esteiified cholesterol and triglycerides. Risk 
for atherosclerosis increases with increasing concentrations of low density Hpoprotein (LDL) 
15 cholesterol, whereas risk is inversely proportional to the levels of high density lipoprotein 
(HDL) cholesterol. The receptor-mediated control of plasma LDL levels has been well- 
defined, and recent studies have now provided new insighte .into HDL metabolisnl 

The elucidation of LDL metabolism began in 1974 by Michael Brown and Joseph 
Goldstein. La brie^ the liver synthesizes a precursor Hpoprotein (very low density 
20 Upoprotein, VLDL) that is converted during circulation to intermediate density Upoprotein 
(IDL) and then to LDL, The majority of the LDL receptors expressed in the body are on the 
surfaces of Uver ceHs, although virtually aU other tissues ("peripheral tissues") express some 
LDL receptors. After binding, the receptor-Upoprotein complex is internalized by the cells 
via coated pits and vesicles, and the entire LDL particle is deUvered to lysosomes, wherein it 
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is disassembled by enzymatic hydrolysis, releasing cholesterol for subsequent cellular 
metabolism. This whole-particle uptake pathway is called "receptor-mediated endocytosis." 
Cholesterol-mediated feedback r&gulation of both the levels of LDL receptors and cellular 
cholesterol biosynthesis help ensure cellular cholesterol homeostasis. Genetic defects in the 

5 LDL receptor in humans results in famihal hypercholesterolemia, a disease characterized by 
elevated plasma LDL cholesterol and premature atherosclerosis and heart attacks. One 
hypothesis for the deleterious eflfects of excess plasma LDL cholesterol is that LDL enters the 
artery wall, is chemically modified, and then is recognized by a special class of receptors 
called macrophage scavenger receptors, that mediate the cellular accumulation of the LDL 

10 cholesterol in the artery, eventually leading to the formation of an athorosclerotic lesion. 

The major Upoprotein classes include intestinally derived chylomicrons that transport 
dietary fets and cholesterol, hepatic-derived VLDL, IDL and LDL that can be atiierogenic, 
and hepatic- and intestinally-derived HDL tiiat are antia&erogenic. Apoprotein B (ApoB) is 
necessary for the secretion of chylomicrons (Apo B48) and VLDL, IDL and LDL (Apo 

15 BlOO). Plasma levels of VLDL triglycerides are determined mainly by rates of secretion in 
LPL hpolytic activity. Plasma levels of LDL cholesterol are determined mainly by the 
secretion of Apo BlOO into plasma, the efficacy with which VI^DL are converted to LDL and 
by LDL receptor-mediated clearance. Regulation of HDL cholesterol levels is complex and 
is affected by rates of synthesis of its Apo proteins, rates of esterfication of free cholesterol to 

20 cholesterol ester by LCAT, levels of triglyceride-rich hpoproteins and CETP-mediated 
transfer of cholesterol esters from HDL, and clearance fi^m plasma of HDL Upids and Apo 
protons. 

Normal lipoprotein transport is associated with low levels of triglycerides and LDL 
cholesterol and high levels of HDL cholesterol. "When Hpoprotein transport is abnormal, 
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lipoprotein levels can change in ways that predispose individuals to atherosclerosis and 
arteriosclerosis (see Giasberg, Endocrinol Meiab. Clin. North Am,, 27: 503-19 (1998)). 

Several lipoprotein receptors may be involved in cellxilar lipid uptake. These 
receptors include: scavenger receptors; LDL receptor-related protein/al-macroglobulia 
5 receptor (LRP); LDL receptor; and VLDL receptor. With the exception of the LDL receptor, 
all of these receptors are expressed in atherosclerotic lesions while scavenger receptors are 
mostly expressed in macrophages, the LRP and VLDL receptors may play an important role 
in mediating lipid uptake in smooth muscle cells (Hiltunen et al. Atherosclerosis^ 137 suppL: 
S8I-8 (1998)). 

10 A major breakdirough in the pharmacologic treatment of hypercholesterolemia has 

been the development of the "statin" class of 3-hydroxy-3-methylglutaryl-CoA reductase 
(HMG CoA reductase) inhibitory dmgs. 3-Hydroxy-3-methylglutaryl-CoA reductase is the 
rate controlling enzyme in cholesterol biosynthesis, and its inhibition in the liver stimulates 
LDL receptor expression. As a consequence, both plasma LDL cholesterol levels and the risk 

15 for atherosclerosis decrease. The discovery and analysis of the LDL receptor system has had 
a profound impact on cell biology, physiology, and medicine. 

HDL is thought to remove xmesterified, or "free" cholesterol (FC) from peripheral 
tissues, after which most of the cholesterol is converted to cholesteryl ester (CE) by enzymes 
in the plasma. Subsequently, HDL cholesterol is efficiently dehvered directly to the hver and 
20 steroidogenic tissues via a selective uptake pathway and the BDDL receptor, SR-BI (class B 
type I scavenger receptor) or, in some species, transferred to other lypoproteins for additional 
transport in metabolism. For additional discussion on HDL and LDL metaboUsm see 
Krieger, Proc. Natl Acad. Set USA, 95:4077-4080, 1998. 
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Recently, a strong interest in the genetic control of peak bone mass has developed in 
the field of osteoporosis. The interest has focused mainly on candidate genes with suitable 
polymoiphisms to test for association with variation in bone mass within the normal range, or 
has focused on examination of genes and gene loci associated with low bone mass in the 
5 range found in patients with osteoporosis. The vitamin D receptor locus (VDR) (Morrison 
al. Nature, Z^ia%A-2%l (1994)), PTH gene (Howard et al., J. Clin, Endocrinol Metab., 
80:2800-2805 (1995); Johnson et al, J. Bone Miner. Res., 8:1 1-17 (1995); Gong et al, J. 
Bone Miner. Res., 10:S462 (1995)) and the estrogen receptor gene (Hosoi et al, J. Bone 
Miner. Res., 10:S170 (1995); Morrison et al. Nature, 367:284-287 (1994)) have figured most 
10 prominently in this work. These studies are difficult because bone mass (the phenotype) is a 
continuous, quantitative, polygenic trait, and is confounded by enviromnental factors such as 
nutrition, co-morbid disease, age, physical activity, and other factors. Also, this type of study 
design requixeslarge numbers of subjects. In particular, the results of VDRstudies to date 
have been confusing and contradictory (Gamero et al, J. Bone Miner. Res., 10:1283-1288 
15 (1995); Eisman et al., J. Bone. Mvier. Res., 10:1289-1293 (1995); Peacock, J. Bone Miner. 
Res., 10:1294-1297 (1995)). Furthermore, the work thus far has not shed much Hght on the 
mechamsm(s) whereby the genetic influences might exert ^?ir effect on bone mass. 

While it is well known that peak bone mass is largely determined by genetic rather 
than environmental factors, studies to detemiine the gene loci (and ultimately the genes) 
20 linked to variation in bone mass are difficult and expensive. Study designs which utilize the 
power of linkage analysis, e.g., siVpair or extended family, are generally more informative 
than simple association studies, although the latter do have valne. However, genetic linkage 
studies involving bone mass are hampered by two major problems. The first problem is the 
phenotype, as discussed briefly above. Bone mass is a continuous, quantitative trait, and 
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establishing a discrete phenotype is difficult. Each anatomical site for measurement may be 
influenced by several genes, many of which may be different from site to site. Tbe second 
problem is the age component of the phenotype. By the time an individual can be identified 
as having low bone mass, there is a high probability that their parents or other members of 
5 prior generations wiU be deceased and therefore unavailable for study, and younger 

generations may not have even reached peak bone mass, making their phenotyping uncertain 
for genetic analysis. 

Regardless, linkage analysis can be used to find the location.of a gene causing a 
hereditary "disorder" and does not require any knowledge of the biochemical nature of the 
10 disorder, i.e., a mutated protein that is beheved to cause the disorder does not need to be 
known. Traditional approaches depend on assumptions concerning the disease process that 
might implicate a known protein as a candidate to be evaluated. The genetic localization 
approach using linkage analysis can be used to first find the general chromosomal region in 
which the defective gene is located and then to gradually reduce the size of the region in 
15 order to determine the location of the specific mutated gene as precisely as possible. After 
the geue itself is discovered within the candidate region, the messenger KNA and the protein 
are identified and, along with the DNA, are checked for mutations. 

The genetic localization approach has practical impHcations since the location of the 
disease can be used for prenatal diagnosis even before the altered gene that causes the disease 
20 is found. Linkage analysis can enable families, even many of those that do not have a sick 
child, to know whether they are carriers of a disease gene and to evaluate the condition of an 
unborn child through molecular diagnosis. The transmission of a disease within famihes, 
then, can be used to find the defective gene. As used herein, reference to "high bone mass" 
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(HBM) is analogous to reference to a disease state, although from a practical standpoint high 
bone mass can actually help a subject avoid the disease known as osteoporosis. 

Linkage analysis is possible because of the nature of inheritance of chromosomes 
from parents to offspring. During meiosis, the two parental homologues pair to guide their 
5 proper separation to daughter cells. While they are lined up and paired, the two homologues 
exchange pieces of the chromosomes, in an event called "crossing over" or "recombination." 
•nie resulting chromosomes are chimeric, that is. they contain parts that origmate from both 
parental homologues. Hie closer together two sequences are on the chromosome, the less 
likely that a recombination event wiU occur between them, and the more closely linked they 
10 are. In a linkage analysis experiment, two positions on the chromosomes are followed from 
one generation to the next to determine the frequency of recombination between them In a 
study of an inherited disease, one of the chromosomal positions is marked by the disease gene 
or its normal counterpart, i.e., the inheritance of the chromosomal region can be detennined 
by examining whether the individual displays symptoms of the disorder or not The other 
15 position is marked by a DNA sequence that shows natural variation in the population such 
that the two homologues can be distinguished based on the copy of the "marker" sequence 
that they possess. In every faimly, the inheritance of the g^ietic maricer sequence is 
compared to the inheritance of the disease state. If, within a family carrying an autosomal 
dominant disorder such as high bone mass, every affected individual carries the same form of 
20 the marker and all the unaffected individuals carry at least one different form of the marker, 
there is a great probability that the disease gene and the marker are located close to each 
other. In this way. chromosomes maybe systematically checked with known markers and 
compared to the disease state. IHe data obtained from the different famiUes is combined, and 
analyzed together by a computer using statistical^ethods. The result is information 
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indicating theprobabiUty of linkage between the genetic marker and the disease allowing 
different distances between them. A positive result can mean that the disease is very close to 
the marker, while a negative result indicates that it is far away on that chromosome, or on an 
entirely different chromosome. 
5 Linkage analysis is perfoimed by typing all members of the affected family at a given 

marker locus and evaluating the co-inheritance of a particular disease state with the marker 
probe, thereby determining how often the two of them are co-inherited. The recombination 
frequency can be used as a measure of the genetic distance between two gene loci. A 
recombination frequency of 1 % is equivalent to 1 map unit, of 1 centiMorgan (cM), which is 
10 roughly equivalent to 1,000 kb of DNA This relationship holds up to frequencies of about " 
20% or20cM. 

The entire human genome is 3,300 cM long. In order to find an unknown disease 
gene within 5-10 cM of a marker locus, the whole human genome can be searched with 
roughly 330 infoimative marker loci spaced at approximately 10 cM intervals (Botstein et al, 

15 Am. J. Hum. Genet, 32:314-331 (1980)). The reliability of linkage results is established by 
using a number of statistical.mefhods. The method most commonly used for the analysis of 
linkage in humans is the LOD score method (Morton, Prog, Clin. Biol. Res., 147:245-265 
(1984), Morton al.. Am. J. Hum. Genet., 38:868-883 (1986)) which was incorporated into 
the computer program LIPED by Ott, ^ Hum. Genet., 28:528-529 (1976). LOD scores 

20 are the logarithm of the ratio of the Hkehhood that two loci are linked at a given distance to 
that fliey are not linked (>50 cM ^ait). The advantage of using logarithmic values is that 
they can be summed among femiiies with the same disease. This becomes necessary given 
the relatively small size of human families. 
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By convention, a total LOD score greater than + 3.0 (that is, odds of linkage at the 
specified recombination frequency being 1000 times greater than odds of no linkage) is 
considered to be significant evidence for linkage at that particular recombmation frequency. 
A total LOD score of less than - 2.0 (that is, odds of no linkage being 100 times greater than 
5 .. odds of linkage at the specified frequency) is considered to be strong evidence that the two 
loci imder consideration are not linked at that particular recombination frequency. UntU 
recently, most linkage analyses have been performed on the basis of two-point data, which is 
the relationship between the disorder under consideration and a particular genetic marker. 
However, as a result of the rapid advances in mappmg the human genome over the last few 
10 years, and concomitant improvements in computer methodology, it has become feasible to 
carry out linkage analyses using multi-point data. Multi-point analysis provide a 
simultaneous analysis of linkage between the disease and several linked genetic markers, 
when the recombination distance among the markers is known. 

Multi-pomt analysis is advantageous for t^v^o reasons. First, the informativeness of the 
15 pedigree is usually increased. Each pedigree has a certain amount of potential information, 
dependent on the number of parents heterozygous for the marker loci and the number of 
affected individuals in the family. However, few markers are sufficiently polymorphic as to 
be informative in aU those individuals. If multiple markers are considered shnultaneously, 
then the probability of an individual being heterozygous for at least one of the markers is 
20 greatly mcreased. Second, an mdication of the position of the disease gene among the 
markers may be determined. This aUows identification of flanking markers, and thus 
eventually aUows isolation of a small region m which the disease gene resides. Lathrop et 
aU Proc. Natl. Acad. ScL USA, 81:3443-3446 (1984) have written the most widely used 
computer package, LINKAGE, for multi-point analysis. 
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There is a need in the art for identifying the gene associated with a high bone mass 
phenotype. The present invention is directed to this, as well as other, important ends. 

SUMMARY OF THE INVENTION 

The present invention describes the Zmaxl gene and the HBM gene on chromosome 
5 11 ql 3. 3 by gmetic linkage and mutation analysis. The use of additional genetic markers 
linked to the genes has aided this discovery. By using linkage analysis and mutation analysis, 
persons predisposed to Upid associated disorders may be readily identified. Cloning methods 
using Bacterial Artificial Chromosomes have enabled the inventors to focus on the 
chromosome region of 1 lql3.3 and to accelerate the sequencing of the autosomal dominant 

10 gene. In addition, the invention identifies the Zmaxl gene and the HBM gene, and idmtifies 
the guanine-to-thymine polymorphism mutation at position 582 in the Zmaxl gene that 
produces the HBM gene and the HBM phenotype as well as altered lipid levels. 

The present invention identifies the Zmaxl gene and the HBM gene, which can be 
used to determine if people are predisposed to abnormal lipid levels and, therefore, 

15 susceptible to diseases mediated by lipids, including, for example, atherosclerosis, 

arteriosclerosis and associated conditions. Individuals wi^ the iiBM gene have lower LDL, 
triglyceride and VLDL levels and higher HDL levels. In other words, the HBM gene is a 
suppressor of atherosclerosis, arteriosclerosis aud associated conditions. This in vivo 
observation is a strong evidence that treatment of normal individuals with the HBM gene or 

20 protein, or firagments thereof, will ameliorate atherosclerosis^ arteriosclerosis and conditions 
related thereto. 

Moreover, such treatment will be indicated in the treatment of lipid-mediated 
•diseases, particularly arteriosclerosis and conditions related thereto. For example, persons 
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predisposed to elevated Hpid levels (i.e., diabetes, hypercholesteremia and other genetic 
diseases, obesity, male gender, and individuals who smoke) may be identified and/or treated 
by means of the invention. Moreover, the methods and compositions of the invention will be 
of use in the treatment or prevention of diabetic atherosclerotic disease, neurovascular 
5 conditions caused by plaqUe build-up {eig.. stroke), cardiovascular disease, poor circulation 
due to plaque build-up ad associated poor would healing. 

In various embodiments, the present invention is directed to nucleic acids, proteins, 
vectors, and transformed hosts of HBM and Zmaxl . 

AdditionaUy, the present invention is directed to appHcations of the above 
10 embodiments of the invention including, for example, gene therapy, pharmaceutical 
. development, and diagnostic assays for bone development disorders. In preferred 
embodiments, the present invention is directed to methods for treating, diagnosing, 
prevKiting and screaiing for osteoporosis. 

These and other aspects of the present invention are described in more detail below. 

15 RRTITF DESCPTPITTON OF TTTTT. VTGTJRES 

Fig. 1 shows the pedigree of the mdividuals used in. the genetic linkage studies. 
Under each individual is anID number, the z-score for spinal BMD. and the allele calls for 
the critical markers on chromosome 11. SoUd symbols represent "affected" individuals. 
Symbols.containing "N" are "unaffected" individuals. DNA from 37 individuals was 

20 genotyped. Question marks denote unknown genotypes or individuals who were not 

genotyped. 

Fig. 2 depicts the BAC/STS content physical map of the HBM region in llql3.3. 
STS markers derived from genes. ESTs. microsatellites, random sequences, and BAG 
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endsequences are denoted above the long horizontal line. For markers that are present in 
GDB the same nomenclature has been used. Locus names (Dl 1S####) are Usted in 
parentheses after the primary name if available. STSs derived from BAG endsequences are 
listed with tiie BAC name first followed by L or R for the left and right end of the clone, 
respectively. The two large arrows indicate the genetic markers fliat define the HBM critical 
region. The horizontal hnes below the STSs indicate BAG clones identified by PCR-based 
screenmg of a nme-fold coverage BAG library. Open circles indicate fliat the marker did not 
amplify the corresponding BAG Hbraiy address during Hbraiy screening. Glone names use 
the foUowmg convention: B for BAG, the plate, row arid column address, followed by -H 
indicating flie HBM project (i.e., B36F16-H). 

Figs. 3A-3F show the genomic structure of Zmaxl wifli flarikdng intron sequences. 
Translation is mitiated by the underlined "ATG" in exon 1. The site of the polymorphism in 
the HBM gene is in exon 3 and is represented by the underhned "G," whereby this nucleotide 
is a "T" in tiie HBM gene. The 3' untranslated region of the mKNA is underlined witiiin exon 
23 (exon 1, SEQ ID NO:40; exon 2, SEQ ID N0:41; exon 3, SBQ ID NO:42; exon 4, SEQ 
ID NO:43; exon 5, SEQ ID NO:44; exon 6, SEQ ID NO:45; exon 7, SEQ ID NO:46; exon 8, 
SEQ ID NO:47; exon 9, SEQ ID NO:48; exon 10, SEQ ID;srO:49; exon 11, SEQ ID NO:50; 
exon 12. SEQ ID NO:51; exon 13, SEQ ID NO:52; exon 14. SEQ ID NO:53; exon 15, SEQ 
ID NO:54; exon 16, SEQ ID NO:55; exon 17, SEQ ID NO:56; exon 18, SEQ ID NO:57; 
exon 19, SEQ ID NO:58; exon 20, SEQ ID NO:59; exon 21, SEQ ID NO:60; exon 22, SEQ 
ID N0:61; and exon 23; SEQ ID NO:62). 

Fig. 4 shows the domain organization of Zmaxl, including the YWTD spacers, the 
exiraceUular attachment site, the binding site for LDL and calcium, the cysteine-rich growth 
factor repeats, the transmembrane region, the ideal PEST region with the GK-II 
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phosphoiylation site and the internalization domain. Fig. 4 also shows the site of the glycine 
to valine change that occurs in the HBM protein. The signal peptide is located at amino acids 
1-22. the extracellular domain is located at amino acids 23-1385, the transmembrane segment 
is located at amino acids 1386-1413. and the cytoplasmic domain is located at amino acids 
5 1414-1615. 

Fig. 5 is a schematic illustration of the BAG contigs B527D12 and B200E21 in 

■ relation to the HBM gene. 

Figs. 6A-6E are the nucleotide and amino acid sequences of the wild-type gene. 
Zmaxl. The location for the base pair substitiition at nucleotide 582. a guanine to thymine, is 
10 underlined. This alleUc variant is the HBM gene. The HBM gene encodes for a protein wilh 
an amino acid substitution of glycine to valine at position 171. The 5' mitranslated region 
(UTR) boundaries bases 1 to 70, and the 3' UTR boundaries bases 4916-5120. 

Figs. 7A and 7B are northern blot analyses showing the expression of Zmaxl in 

various tissues. 
15 Fig. 8 is a PCR product analysis. 

Fig. 9 is allele specific oUgonucleotide detection of the Zmaxl exon 3 mutation. 
Fig. 10 is the cellular localization of mouse Zmaxl_by in situ hybridization at lOOX 
magnification iising sense and antisense probes. 

Fig. 11 is tiie cellular localization of mouse Zmaxl by in situ hybridization at 400X 
20 magnification using sense and antisense probes. 

Fig. 12 is the cellular localization of mbuse Zmaxl by in situ hybridization of 
osteoblasts in the endosteum at 400X magnification using sense and antisense probes. 
Fig. 13 shows antisense inhibition of Zmaxl expression in MC-3T3 cells. 
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DETATT.IT.P T?ir., sCRIPTTON OF TTTF. T NVEISTION 

To aid in the understanding of the specification and claims, the foUowing definitions 
arepTovided. 

"Gene" refers to a DNA sequence that encodes through its template or messenger 
5 RNA a sequence of amino acids characteristic of a specific peptide. The term "gene" 
includes intervening, non-coding regions, as well as regulatory regions, and can include 5' 
and 3' ends. 

"Gene sequence" refers to a DNA molecule, including both a DNA molecule which 
contains a non-transcribed or non-translated sequence. The term is also intended to include 

10 any combination of gene(s), gene j&agment(s), non-transcribed sequence(s) or non-translated 
sequence(s) which are present on fke same DNA molecule. 

The sequences of the present invention may be derived firom a variety of sources 
including DNA, cDNA, synthetic DNA, synthetic RNA or combinations thereof Such 
sequences may comprise genomic DNA which may or may not include naturally occuning 

15 introns. Moreover, such genomic DNA may be obtained in association with promoter 

regions or poly (A) sequences. The sequences, genomic DNA or cDNA may be obtained in 
any of several ways.' Genomic DNA can be extracted andjurified from suitable cells by 
means well known in the art. Alternatively, mRNA can be isolated from a ceU and used to 
produce cDNA by reverse transcription or other means. 

20 "cDNA" refers to complementary or copy DNA produced from an RNA tar^late by 

the action of RNA-depCTident DNA polymerase (reverse transcriptase). Thus, a "cDNA 
clone" means a duplex DNA sequence complementary to an RNA molecule of interest, 
carried in a cloning vector or PGR amplified. This term includes genes &om which the 
intervening sequences have been removed.' 
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"Reconibinant DNA" means a molecule that has been recombined by in vitro splicing 
cDNA or a genomic DNA sequence. 

"Cloning" refers to the use of zn vitro recombination techniques to insert a particular 
gene or other DNA sequence into a vector molecule. In order to successfully clone a desired 
5 gene, it is necessary to use methods for generating DNA fragments, for joining the fragments 
to vector molecules, for introducing the composite DNA molecule into a host cell in which it 
can repUcate, and for selecting the clone having the target gene from amongst the recipient 
host cells. 

"cDNA library" refers to a collection of recombinant DNA molecules containing 
10 cDNA inserts which together comprise the entire genome of an organism. Such a cDNA 
library can be prepared by methods known to one skiUed in the art and described by, for 
example, CoweU and Austin, "cDNA Library Protocols," Methods in Molecular Biology 
(1997). GeneraUy, KNA is first isolated from the cells of an organism from whose genome it 
is desired to clone a particular gene. 
15 "Cloning vehicle" refers to a plasmid or phage DNA or other DNA sequence which is 

able to replicate in a host cell. The cloning vehicle is characterized by one or more 
endonuclease recognition sites at which such DNA sequences may be cut in a deteraunable 
fashion without loss of an essential biological fimction of the DNA, which may contain a 
marker suitable for use in the identification of transformed cells. 
20 "Expression control sequence" refers to a sequence of nucleotides that control or 

regulate expression of structural genes when operably hnked to those genes. These include, 
for example, the lac systems, the trp system, major operator and promoter regions of the 
phage lambda, the control region of fd coat protein" and other-sequences known to control the 
expression of genes in prokaryotic or eukaryotic cells. Expression control sequences will 
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vary depending on whether the vector is designed to express the operably hnked gene in a 
prokaiyotic or eukaiyotic host, and may contain transcriptional elements such as enhancer 
elements, termination sequences, tissue-specificity elements and/or translational initiation and 
termination sites. 

5 "Expression vehicle" refers to a vehicle or vector similar to a cloning vehicle but 

which is capable of expressing a gene which has been cloned into it. after transformation into 
a host. The cloned gene is usually placed under the control of (i.e., operably linked to) an 
expression control sequence. 

"Operator" refers to a DNA sequence capable of interacting with the specific 
10 repressor, thereby controlling the transcription of adjacent gene(s). 

"Promoter" refers t6 a DNA sequence that can be recognized by an RNA polymerase. 
The presence of such a sequence pennits the RNA polymerase to bind and initiate 
transcription of operably linked gene sequences. 

"Promoter region" is intended to include the promoter as well as other gene sequences 
15 which may be necessary for the initiation of transcription. The presence of a promoter region 
is sufficient to cause the ejq)ression of an operably linked gene sequence. 

"Operably linked" means that the promoter controls the initiation of expression of the 
gene. A promoter is operably linked to a sequence of proximal DNA if upon introduction 
into a host ceU the promoter determines the transcription of the proximal DNA sequence(s) 
20 into one or more species of RNA. A promoter is operably linked to a DNA sequence if the 
promoter is enable of initiating transcription of that DNA sequence. 

"Prokaiyote" refers to all organisms without a true nucleus, mcluding bacteria. 
"Eukaryote" refers to organisms and cells that have a trae nucleus, including 
mammalian ceUs. 
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"Host" includes prokaiyotes and eukaryotes, such as yeast and filamentous fungi, as 
well as plant and animal cells. The term includes an organism or cell that is the recipient of a 
replicable expression vehicle. 

By "animal" is meant to include vertebrates. Preferred vertebrates include mammals 
5 and birds, but also include fish, reptiles and amphibians. Preferred mammals include: 
himians, primates, rodents, canines, felines and livestock. 

"Fragment" of a gene refers to any variant of the gene that possesses the biological 
activity of that gene. 

"Variant" refers to a gene that is substantially s im i l ar in structure and biological 
10 activity or umnunological characteristics to either the eatire gene or to a firagment of the 
gene. Provided tiiat the two genes possess a similar activity, they are considered variant as 
that term is used herein even if the sequence of amino acid residues is not identical 

"AmpUfication of nucleic acids" refers to methods such as polymerase chain reaction 
(PGR), Hgation amphfication (or Hgase chain reaction, LCR) and ampUfication methods 
15 based on the use of Q-beta repHcase. These methods are weU known in the art and described, 
for example, in U.S. Patent Nos. 4,683,195 and 4,683,202. Reagents and hardware for 
conducting PGR are coromerciaUy available. Primers useM for amphfying sequences firom 
the HBM region are preferably complementary to, and hybridize specificaUy to sequences in 
the HBM region or in regions that flank a target region therein. HBM sequences generated 
20 by amplification may be sequenced directly. Alternatively, the amplified sequence(s) may be 
cloned prior to sequence analysis. 

"Antibodies" may refer to polyclonal and/or monoclonal antihodies and fragments 
thereof, and immunologic bmding equivalents thereof, that can bind to the HBM and Zmaxl 
proteins and fragments thereof or to nucleic acid sequences from the HBM or Zmaxl region, 
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particularly firom the HBM locus or a portion thereof. The teim antibody is used both to refer 
to a homogeneous molecular entity, or a mixture such as a serum product made up of a 
pluraUty of different molecular entities. Proteins may be prepared syntheticaUy in a protein 
synthesizer and coupled to a carrier molecule and injected over several months into rabbits. 
Rabbit sera is tested for immunoreactivity to the HBM protein or fragment. Monoclonal 
antibodies may be made by injecting mice with the proteins, or fragments thereof. ' 
Monoclonal antibodies wiU be screened by ELISA and tested for specific immunoreactivity 
with HBM protein or fragments thereof Harlow et al. Antibodies: A Laboratory Manual, 
Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1988). These antibodies will be 
useful in assays as well as pharmaceuticals. Antibodies can include antibody fragments {e.g.. 
scFv, Fab, F(ab')2, etc) as weU as human antibodies, humanized antibodies aadprimatized 
antibodies. 

"HBM" refers to high bone mass, but polymorphisms associated with J2BAf gene, 
which C2in also be involved in lipid modulation. 

"HBM protein" refers to a protein that is identical to a Zmaxl proteia except that it 
contains an alteration of glycme 171 to valine. An HBM protein is defined for any organism 
that encodes a Zmaxl true homologue. For example, a mouse HBM protein refers to the 
mouse Zmaxl protein having.the glycine 170 to valine substitution. 

"^£BAfgene" refers to the genomic DNA sequence found in individuals showing the 
HBM characteristic or phenotype, where the sequence encodes the protein indicated by SEQ 
ID NO: 4. The HBM gene and the Zmaxl gene are allehc. The protein encoded by the HBM 
gene has the property of causing elevated bone mass and also altering physiologic Upid 
levels, while the protein encoded by the Zmaxl gene does not. The HBM gene and the Zmaxl 
gene differ in that the HBM gens has a thymine at position 582, while the Zmaxl gene has a 
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guanine at position 582. The H3M gene comprises the nucleic acid sequence shown as SEQ 
ID NO: 2. The HBMgeae may also be referred to as an "HBM polymoiphism." 

"Normal," "wild-type," "unaffected" and "Zmaxl " aU refer to the genomic DNA 
sequence that encodes the protein indicated by SEQ ID NO: 3. The Zmaxl gene has a 
5 guanine at position 582. The Zmaxl gene comprises the nucleic acid sequence shown as SEQ 
ID NO: 1. "Normal," "wild-type," "unaffected" and "Zmaxl" also refer to allelic variants of 
the genomic sequence that encodes proteins that do not contribute to elevated bone mass. 
The Zmaxl gene is common in the human population, while the HBM gene is rare. 

"5YWT+EGF" refers to a repeat unit found in the Zmaxl protein, consisting of five 
10 YWT repeats followed by an EGF rq)eat. 

"Bone development" generally refers to any process involved in the change of bone 
over time, including, for example, normal development, changes that occur during disease 
states, and changes that occur during aging. "Bone development disorder" particularly refers 
to any disorders in bone development including, for example, changes that occur during 
15 disease states and changes that occur during aging. Bone development may be progressive or 
cycUcal in nature. Aspects of bone that may change during development include, for 
■ example, minerahzation, formation of specific anatomical futures, and relative or absolute 

nunibers of various cell types. 

"Bone modulation" or "modulation of bone formation" refers to the abiUty to affect 
20 any of the physiological processes involved m bone remodeling, as will be appreciated by one 
skilled in the art, including, for example, bone resorption and appositional bone growth, by, 
inter aha, osteoclastic and osteoblastic activity, and may comprise some or all of bone 
formation and development as used herein. 
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"Normal bone density" refers to a bone density withm two standard devianons oi a ^ 
score of 0. 

By "lipid regulation" or "lipid modulation" is meant the ability to alter by modulating 
the HBM or Zmaxl genes, roRNA or protein encoded thereby the levels of a lipid, Alt^ed 
5 levels of lipid include very low density Hpoproteins (VLDL), low density lipoproteins (LDL), 
high density lipoprotein (HDL) and triglycerides. The regulation or modxilation can be an 
increase or decrease in the lipid level by an agent, which when administered to a subject 
modulates HBM or Zmaxl activity. By "lipid metabohsm" is meant the physiological cycle . 
through which the various triglycerides and lipoproteins proceed. Agents of the invention 
10 can also be said to modulate the metabolism of various lipids. 

"Lipid" preferably includes very low density hpoproteins (VLDL), low density 
hpoproteins (LDL), intemiediate density lipoprotein (CDL), high dmsity Upoprotein (HDL) 
and triglycerides. Lipids can also include apolipoproteins, such as apolipoprotein A-1 (APO 
A-1), apolipoprotein B (APO B), ^ohprotein E (APO E) and lipoproteins such as Upoprotein 
15 a(LIPOa). ' 

By "lipid-mediated disease or condition" is meant to include arteriosclerosis and 
related conditions, hypercholesteremia, hyperhpidemia, atherosclerosis, and conditions or 
lifestyles associated with elevated lipid levels {e.g., diabetes meUitus, smoking and obesity) 
such as those discussed hereiiL 
20 By "arteriosclerosis" is meant to include hypertrophy of the media and subiutimal 

fibrosis with hyaline degeneration which can result in ectasia, aneurysm, increased systolic 
pressure, thrombus formation and embohsm. Disorders associated with arteriosclerosis 
include, but are not limited to, nonatheromatous arteriosclerosis conditions such as: diabetes 
mellitus, chronic renal insufficiency, chronic vitamin D intoxication, pseudoxanthoma 
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elasticxim, idiopathic arterial calcification in infancy, aortic valvular calcification in the 
elderly, and Werner's syndrome. Additional, disorders associated with arteriosclerosis and 
atherosclerosis include: diabetes mellitus, hypertension, familial hypercholesterolemia, 
familial combined hyperlipidemia, familial dysbetaUpoproteinemia, famihal 
5 hypoalphalipoproteinemia, hypothyroidism, cholesterol ester storage disease, systemic lupus 
erythematosus and homocysteinemia. 

By "atherosclerosis" is meant patchy intramural thickening of the subintirna that 
encroaches on the arterial lumen and can cause obstruction. Atherosclerotic plaque consists 
of the accumulation of lipids, ceUs, annective tissue and glycosammoglycans. It can cause 
10 the foUowing conditions: stenosis, thrombosis, aneurysm, or embolus supervenes, as well as 
angina as well as the conditions listed above. 

A "Zmaxl system" refers to a purified protein, cell extract, ceU, animal, human or any 
other composition of matter in which Zmaxl is present in a normal or mutant fonn. 

A "surrogate marker" refers to a diagnostic indication, symptom, sign or other feature 
15 that can be observed in a cell, tissue, human or animal that is correlated with the HBM gene 
or elevated bone mass or both, but that is easier to measure than bone density. The general 
concept of a surrogate marker is well accepted in diagnostiq.medicine. . 

The present invention encompasses the Zmaxl gene and Zmaxl protein in the forms 
indicated by SEQ ED NOS: 1 and 3, respectively, and other closely related variants, as well as 
20 the adjacent chromosomal regions of Zmaxl necessary for its accurate expression, hi a 

preferred embodiment, the present invention is directed to at least 15 contiguous nucleotides 
of the nucleic acid sequence of SEQ ID NO: 1 . 

The present invention also encompasses the HBM gene and HBM protein in the forms 
indicated by SEQ ED NO: 2 and 4, respectively, and other closely related variants, as well as 
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the adjacent chromosomal regioiis of the fiSMgene necessary for its accurate expression. In 
a preferred embodiment, the present invention is directed to at least 15 contiguous 
nucleotides of the nucleic acid sequence of SEQ ID NO: 2. More preferably, the present 
invention is directed to at least 15 contiguous nucleotides of the nucleic acid sequence of 
5 SEQ ID NO: 2. wherein one of the 15 contiguous nucleotides is the thymme at nucleotide 



582. 



The invention also relates to the nucleotide sequence of the ZmaxI gene region, as 
weU as the nucleotide sequence of the BBM gens region. More particularly, a preferred 
embodiment are the BAG clones containing segments of the ZmaxI gene region B200E21-H 
10 and B527DI2-H. A preferred embodiment is the nucleotide sequence of the BAG clones 
consisting of SEQ ID NOS: 5-12. 

The invention also concerns the use of the nucleotide sequence to identify DNA 
probes for the ZmaxI gene and the HBM gcno, PGR primers to amphfy the ZmaxI gene and 
the HgM gene, nucleotide polymorphisms in the Zmaxi gene and the .HBM gene, and- 
15 regulatory elements oftheZTwaxi gene and the .fflSM gene. 

This invention describes the further locahzation of die chromosomal location of the 
ZmaxJ gene and HBM gene on chromosome 1 lql3.3 between genetic markers Dl 1S987 and 
SNP_CONTIG033-6, as weU as the DNA sequences of the Z?naxl gene and the HBM gene. 
The chromosomal location was refined by the addition of more genetic markers to the 
20 mappmg panel used to map the gene, and by the extension of the pedigree to include more 
individuals. The pedigree extension was critical because the new individuals that have been 
genotyped harbor critical recombination events tiiat narrow the region. To identify genes in 
the region on 1 lql3.3, a set of BAG clones containing this chromosomal region was 
identified. The BAG clones served as a tempMe for genomic DNA sequencing, and also as a 



-22 



3NS0OCID: <WO_01928S1A2_I > 



wo 01/92891 



PCTAJSOl/16946 



reagent for identifying coding sequences by direct cDNA selection. Genomic sequencing and 
direct cDNA selection were used to characterize more than 1 .5 milHon base pairs of DNA 
from 1 lql3.3. The Zmaxl gene was identified within this region and the HBM gene was then 
discovered after mutational analysis of affected and unaffected individuals. 
5 When a gene has been genetically localized to a specific chromosomal region, the 

genes in this region can be characterized at the molecular level by a series of steps that 
mclude: cloning of the entire region of DNA in a set of overlapping clones (physical 
mapping), characterization of genes encoded by these clones by a combination of direct 
cDNA selection, exon trapping and DNA sequenciag (gene identification), and identification 
10 of mutations in these genes by comparative DNA sequencing of affected and unaffected 
members of the HBM kindred (mutation analysis). 

Physical mapping is accompUshed by screening libraries of human DNA cloned in 
vectors that are propagated in E. coli or S. cereviseae using PGR assays designed to amplify 
unique molecular landmarks in the chromosomal region of interest To generate a physical 
15 - map of the HBM candidate region, a Hbrary of human DNA cloned in Bacterial Artificial 
Chromosomes (BACs) was screened with a set of Sequence Tagged Site (STS) markers that 
had been previously mapped to chromosome 1 1 ql2-ql3 b^the efforts of the Human Genome 
Project. 

STSs are unique molecular landmarks in the human genome that can be assayed by 
20 PGR. Through the combined efforts of the Human Genome Project, the location of thousands 
of STSs on the twenty-two autosomes and two sex chromosomes has been detennined. For a 
positional cloning effort, the physical map is tied to the genetic map because the markers 
used for genetic mapping can- also be used as STSs for physical mapping. By screening a 
BAG hbrary with a combination of STSs derived firom genetic maikers, genes, and random 
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DNA fragments, a physical map comprised of overlapping clones representing all of the 
DNA m a chromosomal region of interest can be assembled. 

BACs are cloning vectors for large (80 kilobase to 200 Idlobase) segments of hmnan 
or other DNA that are propagated in E. cott. To construct a physical map using BACs, a 
5 Hbrary of BAG clones is screened so that individual clones harboring the DNA sequence 
corresponding to a givea STS or set of STSs are identified. Throughout most of the human 
genome, flie STS markers are spaced approximately 20 to 50 kilobases apart, so that an 
individual BAG clone typically contains at least two STS markers. In addition, the BAG 
libraries that were screened contain enough cloned DNA to cover the human genome six 
10 times over. Therefore, an individual STS typically identifies more than one BAG clone. By 
screening a six-fold coverage BAG hbraiy with a series of STS markers spaced 
approximately 50 kilobases apart, a physical map consisting of a series of overlapping BAG 
clones, i.e. BAG contigs, can be assembled for any region of the human genome. This map is 
closely tied to the genetic map because many of the STS markers used to prepare the physical 
15 map are also genetic markers. 

When constructing a physical map, it often happens that there are gaps in the STS 
map of the genome that result in the inability to identify BAG clones that are overlapping in a 
given location- Typically, the physical map is first constructed from a set of STSs that have 
been identified through the publicly available Hterature and World Wide Web resources. The 
20 initial map consists of several separate BAG contigs that are separated by gaps of unknown 
molecular distance. To identify BAG clones that fill these gaps, it is necessary to develop 
new STS markers fix>m the ends of the clones on either side of the gap. This is done by 
sequencing the terminal 200 to 300 base pairs of the BAGs flanking the g^, and developing 
a PGR assay to amplify a sequence of 100 or more base pairs. If the temiinal sequences are 
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demonstrated to be unique within the human genome, then the new STS can be used to screen 
the BAG library to identify additional B AGs that contain the DNA from the gap in the 
physical map. To assemble a BAG contig that covers a region the size of the HBM candidate 
region (2,000,000 or more base pairs), it is often necessary to develop new STS markers from 

5 the ends of several clones. 

After building a BAG contig, this set of overlapping clones serves as a template for 
- identifydng the genes encoded in the chromosomal region. Gene identification can be 
accomplished by many methods. Three methods are commonly used: (1) a set of BACs 
selected from the BAG contig to represent the entire chromosomal region can be sequenced, 

10 and computational methods can be used to identify all of the genes, (2) the BAGs from the 
BAG contig can be used as a reagent to clone cDNAs corresponding to the genes encoded in 
the region by a method termed direct cDNA selection, or (3) the BAGs from the BAG contig 
can be used to identify coding sequences by selecting for specific DNA sequence motifs in a 
procedure called exon trapping. The present invention includes genes identified by the first 

15 two methods. 

To sequence the entire BAG contig representing the HBM candidate region, a set of 
BAGs was chosen for subcloning into plasmid vectors andjubsequent DNA sequencing of 
these subclones. Since the DNA cloned in liie BACs represents genomic DNA, this 
sequencing is referred to as genomic sequencing to distinguish it from cDNA sequencing. To 
20 initiate the genomic sequencing for a chromosomal region of interest, several non- 
overlapping BAG clones are chosen. DNA for each BAG clone is prepared, and the clones 
are sheared into random small fragments which are subsequently cloned int6 standard 
■ plasmid vectors such as pUG18. The plasmid clones are then grown to propagate the smaller 
fragments, and these are the templates for sequencing. To ensure adequate coverage and 
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sequence quality for the BAG DNA sequence, suflBcient plasmid clones are sequenced to 
yield six-fold coverage of the BAG clone. For example, if the BAG is 100 kilobases long, 
* then phagennds are sequenced to yield 600 kilobases of sequence. Since the BAG DNA was 
randomly sheared prior to cloning in the phagemid vector, the 600 kilobases of raw DNA 
5 sequence can be assembled by computational methods into overlapping DNA sequences 
termed sequence contigs. For the purposes of initial gene identification by computational 
methods, six-fold coverage of each BAG is sufficient to yield ten to twenty sequence contigs 
of 1000 base pairs to 20,000 base pairs. 

The sequencing strategy employed in this invention was to initially sequence "seed" 

10 BAGS from the BAG contig in the HBM candidate region. The sequence of the "seed" BAGs 
was tiien used to identify minimally overlapping BAGs from the contig, and these were 
subsequently sequenced. In this manner, the entire candidate region was sequenced, with 
several small sequence gaps left in each BAG. This sequence served as the template for 
computational gene identification. One method for computational gene identification is to 

15 compare the sequence of BAG contig to publicly available databases of cDNA and genomic 
sequences, e.g. unigene, dbEST, genbank. These comparisons are typically done using the 
BLAST family of computer algorithms and programs (Altschid et al, J. Mot Biol, 215:403- 
410 (1990)). The BAG sequence can also be translated into protein sequence, and the protein 
sequence can be used to search publicly available protein databases, using aversion of 

20 BLAST designed to analyze protein sequences (Altschul ei al, Nucl Acids Res., 25:3389- 
3402 (1997)). Another method is to use computer algorithms such as MZEF (Zhang, Proc. 
Natl Acad. ScU 94:565-568 (1997)) and GRAIL (Uberbacher et al. Methods EnzymoU 
266:259-28 1 (1996)), which predict the location of exons in the sequence based on the 
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presence of specific DSSA sequence motifs that are common to all exons, as weU as tbe 
presence of codon usage typical of human protein encoding sequences. 

In addition to identifying genes by computational methods, genes were also identified 
by direct cDNA selection (Del Mastroef a/., GenomeRes. 5(2):185-194 (1995)). Indirect 
5 cDNA selection, cDNA pools from tissues of interest are prepared, and the BACs from the 
candidate region are used in a liquid hybridization assay to capture the cDNAs which base 
pair to coding regions in the BAG. In the methods described herein, the cDNA pools were 
created from several different tissues by random priming the first strand cDNA from polyA 
KNA, synthesizing the second strand cDNA by standard methods, and adding linkers to the 
10 ends of the cDNA fragments. The hnkers are used to amplify the cDNA pools. The BAG 
clones are used as a template for in vitro DNA synthesis to create a biotin labelled copy of the 
BAG DNA. The biotin labeUed copy of the BAG DNA is then denatured and mcubated with 
an excess of the PGR amplified, linkered cDNA pools which have also been denatured. The 
BAG DNA and cDNA are allowed to anneal in solution, and heteroduplexes between the 
15 BAG and the cDNA are isolated using streptavidin coated magnetic beads. The cDNAs that 
are captured by the BAG are then amphfied using primers complimentary to the linker 
sequences, and the hybridization/selection process is repeated for a second round. After two 
rounds of direct cDNA selection, the cDNA fragments are cloned, and a library of these 
direct selected fragments is created. 
20 The cDNA clones isolated by direct selection are analyzed by two methods. Since a 

pool of BACs from the HBM candidate region is used to provide the genomic DNA 
sequence, the cDNAs must be mapped to individual BACs. This is accomplished by arraying 
the BACs in microtiter dishes, and repUcating their DNA in high density grids. Individual 
cDNA clones are then hybridized to the grid to confirm that they have sequence identity to an 
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individual BAG from the set used for direct selection, and to determine the specific identity 
of that BAG. cDNA clones that are confirmed to correspond to individual B AGs are 
sequenced. To determine whether tbe cDNA clones isolated by direct selection share 
sequence identity or similarity to previously identified genes, the DNA and protein coding 
sequences are conq)ared to publicly available databases using the BLAST family of 
programs. 

The combination of genomic DNA sequence and cDNA sequence provided by BAG 
sequencing aad by direct cDNA selection yields an initial list of putative genes in tbe region. 
The genes ru the region were aU candidates for the HBM locus. To further characterize each 
gene, Northern blots were performed to determine the size of the transcript corresponding to 
each gene, and to detemiine which putative exons were transcribed together to make an 
individual gene. For Northern blot analysis of each gene, probes were prepared from direct 
selected cDNA clones or by PGR amplifying specific fragments from genomic DNA or from 
the BAG encoding the putative gene of interest. The Northern blots gave information on the 
size of the transcript and the tissues in which it was expressed. For transcripts which were 
not highly expressed, it was sometimes necessary to perform a reverse transcription PGR 
assay using KNA from the tissues of interest as a templatejor the reactiorL 

Gene identification by computational methods and by direct cDNA selection provides 
imique information about the genes in a region of a chromosome. When genes are identified, 
then it is possible to examine different individuals for mutations in each gene. 

L Phenotyping using DXA Measurements 

Spinal bone mineral content (BMC) and bone mineral density (BIVID) measurements 
performed at Greighton University (Omaha, Nebraska) were made by DXA using a Norland 
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Instrunients densitometer Q^orland XR2600 Densitometer, Dual Energy X-ray 
Absorptiometry, DXA). Spinal BMC and BMD at other locations used the machinery 
available. There axe estimated to be 800 DXA machines currently operating in the U.S. Most 
' larger cities have offices or imaging centers which have DXA capabiUties, usually a Lunar or 
5 Hologic machine. Each location that provided spine BMC and BMD data included copies of 
the printouts from their machines to provide verification that the regions of interest for 
measurement of BMD have been chosen appropriately. Complete chnical histories and 

skeletal radiographs were obtained. 

The HBM phenotype is defined by the following criteria: very high spinal BMD; a 
10 chnical history devoid of any known high bone mass syndrome; and skeletal radiographs 
showing a normal shape of the appendicular skeleton. 

n. Genotyping of Mici-osatellite Markers ^ 

TO narrow the genetic interval to a region smaller than that originally reported by 
Johnson et a/.. Am. J. Hum, Genet, 60:1326-1332 (1997), additional microsatelHte markers 
15 onchromosomellql2.13weretyped.TheBewmarkersincluded:DllS4191,DllS1883, 

D11S1785, D11S4113. D11S4136. D11S4139, (Dib, et al^Nature, 380:152-154 (1996), 
FGF3 (Polymeropolous. et aL, Nucl. Acid Res., 18:7468 (1990)), as well as 
GTC_HBM_Marker_l, GTC_HBM_Marker_2, GTC_HBM_Marker_3, 
GTC_HBM_Marker_4. GTC_HBM_Marker_5, GTC_HBM_Marker_6, and 
20 GTC_HBM_Maiker_7 (See Fig. 2). 

Blood (20 ml) was drawn mto lavender cap (EDTA containing) tubes by a certified 
phlebotomist. The blood was stored refrigerated until DNA extraction. DNAhasbeen 
extracted from blood stored for up to 7 days m the refrigerator without reduction m the 
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quality or quantity of yield. For those subjects that have blood drawn at distant sites, a 
shipping protocol was successfiiUy used on more than a dozen occasions. Blood samples 
were shipped by overnight express in a styrofoam container with freezer packs to provide 
cooling. Lavender cap tubes were placed on individual plastic shippiag tubes and then into 
5 "zip-lock" biohazard bags. When the samples arrived the next day, they were immediately 
processed to extract DNA. 

The DNA extraction procedure used a kit purchased from Gentra Systems, Inc. 
(Minneapolis, Minnesota). Briefly, the procedure involved adding 3 volumes of a red blood 
cell lysis buffer to the whole blood. After inculbations for 10 minutes at room temperature, 
10 the solution was centrifiiged ia a Beckman tabletop centrifuge at 2,000 X g for 10 minutes. 
The white blood cell pellet was resuspended in Cell Lysis Buffer. Once the pellet was 
completely resuspended and free of cell clumps, the solution was digested witih KNase A for 
1 5 minutes at 37 C. Proteins were precipitated by addition of the provided Protein 
Precipitation Solution and removed by centrifugation. The DNA was precipitated out of the 
15 supernatant by addition of isopropanoL This method was simple and fast, requiring only 1-2 
hours, and allowed for the processing of dozens of samples simultaneously. The yield of 
DNA was routLaely >8 mg for a 20 ml sample of whole blood and had a MW of >50 kb. 
DNA was archived by storing coded 50 /zg aliquots at -80 ''C as an efhanol precipitate. 

DNA was genotyped using one fluorescently labeled oligonucleotide primer and one 
20 unlabeled oligonucleotide primer. Labeled and unlabeled oligonucleotides were obtained 
from Integrated DNA Technologies, Inc. (CoralviUe, Iowa). All other reagents for 
microsatellite genotyping were purchased from Peridn Elmer- Applied Biosystems, Inc. ("PE- 
ABF) (Norwalk, Connecticut). Individual PGR reactions were perfonned for each marker, as 
described by PE-ABI using AmpUTag DNA Polymerase. The reactions were added to 3.5 /A 
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of loading buffer conUdring d=ioniz=d tommmi.. blue dextran and TAMRA 350 size 
^darde (PE-ABI). Afterbeatog a. 95=C for 5 nduutes« denatoe toDNA. ^ samples 
were loaded and electtophoresed as described in to operator's manual for the Model 377 
DNA Sequencer (PE-ABl. Foster OXy. Califonua). After gel electrophoresis, the data was 
5 analyzedusingPE-ABIOENESCA^P'andGENOTYPER™software. First. wifhina. 

GENBSCAN™ softwarc the lane tracldng was manually optimized prior to the first step of 
analysis. After the gellane data was extracted. tl» standard curve piffles ofeach lane were 
ex^aninedandverifiedfortoearityandsizecaffing. Lanes, which had problems with either 

of these parameters, were re-txacked and verifiei Once all lanes we« tracked and fce size 
10 standardswereconectlyideutified.theda.awereimpor.edintoGENOTYPER™forallele 

identification To e^te allele calling (binning), the program Ur*age Designer ftom the 
Internet web-site of Dr. Guy Van Camp Cht.p./al..~.ac.be/u/dnalab/ldJ.tml) was used, 
ms program greatly ^iUtates^ehnportingof datag«reratedby GENOTYPEP.™ into «ae 
pedlgreedrawingprogramCyrilHcCVersionmCherweUScientificPublishingLimi^ 
15 Oxf6rd.GreatBritain)andsubsequer^linkageanalysisusingtheprogramLI^O^ 

(Lathiop etal..Am. J. Hum. Genet, 37:482-498 (1985)). 

J— * 

m. Linkage Analysis 

Fig.ldemonstra.esthepedigreeoftheindivid^usedinfhegeneticlinkagestudies 

forthisinveaticn. Specificany,two^<rfntlinkage analysis was performed usingtheMUNK 
20 andUNKMAPcomponentsofthepro^UNKAGBO-atbrop^a/.,^ J.i/-. Gene... 
37:482-498 (1985)). Pedigree/marker data was exported ftom CyrilUc as a pre-file into the 
Makeped program and converted mto a suitable ped-file for linkage analysis. 
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The original linkage analysis was perfonned using tJiree models: (i) an autosomal 
dominant, fiilly penetrant model, (ii) an autosomal dominant model with reduced penetrance, 
and (iii) a quantitative trait model. The HBM locus was mapped to chromosome llql2-13 
by analyzing DNA for linked markers firom 22 members of a large, extended kindred. A 
5 highly automated technology was used with a panel of 345 fluorescent markers which 
spanned the 22 autosomes at a spacing interval ranging from 6-22 cM. Only markers from 
this region of chromosome 11 showed evidence of hnkage (LOD score ~3,0). The highest 
LOD score (5.74) obtained by two-point and multipoint analysis was Dl 1S987 (map position 
55 in Fig. 2). The 95% confidence interval placed the HBM locus between markers Dl 1S905 
10 and Dl 1 S937 (map position 41-71 in Fig. 2). Haplotype analysis also places the Zmaxl gene . 
in this same region. Further descriptions of the markers D11S987, D11S905, andDllS937 
can be found in Gyapay et ah. Nature Geneticsy Vol. 7, (1994). 

In this invention, the inventors report the narrowing of the HBM interval to the region 
between markers Dl 1S987 and GTC_HBM_Marker_5. These two markers lie between the 
15 delimiting markers from the original analysis (P11S11S905 andDl 1S937} and are 

^proxiihately 3 cM from one another. The narrowing of the interval was accomplished 
using genotypic data from the maikers D11S4191, Dl 1S1883, DllS J785, D11S4113, 
D11S4136, D11S4139, (Dib et aL, Nature, 380:152-154 (1996)), FGF3 (Polymeropolous et 
al.,Nucl. Acid Res., 18:7468 (1990)) (information about the genetic markers can be found at 
20 the internet site of the Genome Database, htlp://gdbwww.gdb.ois/), as well as the maikers 
GTC_HBM_Marker_l, GTC_HBM_Maricer_2, GTC_HBM_Marker_3, 
GTC_HBM_Marker_4, GTC_HBM_Marker_5, GTC_HBM_Marker_6, and 
GTC_HBM_Marker_7. 
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As shown in Fig. 1, haplotype analysis with the above genetic markers identifies 
recombination events (crossovers) in mdividuals 9019 and 9020 that significantly refine the 
interval of chromosome 1 1 to which the Zmaxl gene is localized. Individual 9019 is an 
HBM-affected individual that inherits a portion of chromosome 1 1 firom the matemal 

5 chromosome with the HBM gene, and a portion fi-om .the chromosome 1 1 homologue. The 
portion inherited firom the HBM gene-carrying chromosome includes markers Dl 1S935, 
D11S1313, GTC_HBM_Marker_4, D11S987, D11S1296, GTC_HBM_Marker_6, 
GTC_HBM_Marker_2, Dl 1 S970, GTC_HBM_Marker_3, Dl 1S41 13, 
GTC_HBM_Marker_l, GTC_HBM_Marker_7 and GTC_HBM_Marker_5. The portion 

10 fi-om D 1 1S4136 and continuing in the telomeric direction is derived firom the non-HBM 
chromosome. This data places the Zmaxl gene in a location centromeric to the marker 
GTC_HBM_Marker_5. Individual 9020 is an unaffected individual who also exhibits a 
critical recombination event. This individual inherits a recombinant paternal chromosome 1 1 
that includes markers D11S935, D11S1313, GTC_HBM_Marker_4, D11S987. D11S1296 

15 and GTC_HBM_Marker_6 firom her father's (individual 0115) chromosome 1 1 homologue 
that carries the HBM gene, and markers GTC_HBM_Mari£er_2, Dl 1S970, 
GTC_HBM_Marker_3, GTC_HBM_Marker_l, GTC_HBMJMarker_7, 
GTC_HBM_Marker_5,DllS4136, D11S4139,D11S1314, andDllS937 firom her father's 
chromosome 1 1 that does not carry the HBM gene. Marker D11S41 13 is uninformative due 

20 to its homozygous nature in individual 0115. This recombination event places the 
centromeric boundary of the HBM region between markers Dl 1S1296 and Dl 1S987. 

Two-point linkage analysis was also used to confirm the location of the Zmaxl gene 
on chromosome 1 1. The hnkage results for two point linkage analysis under a model of fijU 
penetrance are presented in Table 1 below. This table hsts the genetic markers in the first 
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column and the recombination fractions across the top of the table. Each cell of the column 
shows the LOD score for an individual marker tested for linkage to the Zmaxl gene at the 
recombination fraction shown in the first row. For example, the peak LOD score of 7.66 
occurs at marker Dl 18970, which is within the iaterval defined by haplotype analysis. 
5 TABLE 1 



Marker 


0.0 


0.05 


0.1 


0.15 


0.2 


0.25 


0.3- 


0.35 


0.4 


D11S935 


- infinity 


0.39 


0.49 


0.47 


0.41 


0.33 


0.25 


0.17 


0.10 


D11S1313 


- infinity 


2.64 


2.86 


2.80 


2.59 


2.30 


1.93 


1.49 


1.00 


D11S987 


- infinity 


5.49 


5.18 


4.70 


4,13 


3,49 


2.79 


2.03 


1.26 


D11S4113 


4.35 


3.99 


3.62 


3.24 


2.83 


2.40 


1.94 


1.46 


0.97 


Dllgl337 


• 2.29 


2.06 


1.81 


1.55 


1.27 


0.99 


0.70 


0.42 


0.18 


D11S970 


7.66 


6.99 


6.29 


5.56 


4.79 


3,99 


3.15 


2.30 


1.44 


D11S4136 


6.34 


5.79 


5.22 


4.61 


3.98 


3.30 


2.59 


1.85 


1.11 


D11S4139 


6.80 


6.28 


5.73 


5.13 


4.50 


3.84 


3.13 


2,38 


1.59 


FGF3 


0.59 


3.23 


3.15 


2.91 


2.61 


2.25 


1.84 


1.40 


0.92 


D11S1314 


6.96 


6.49 


5.94 


5.34 


4.69 


4.01 


3.27 


2.49 


1.67 


D11S937 


-infinity 


4.98 


4.86 . 


4.52 


4.06 


3,51 


2.88 


2.20 


1.47 



A single nucleotide polymorphism (SNP) further defines the HBM region. This SNP 
is termed SNP_Contig033-6 aud is located 25 kb centromeric to the genetic marker 
20 GTCJEIBM_Marker_5. This SNP is telomeric to the genetic marker GTC_HBM_Marker_7. 
SNP_Contig033-6 is present in HBM-affected individual 0113.- HowevCT, the HBM-affected 
individual 9019, who is the son of 0113, does not carry this SNP. Therefore, this indicates 
that the* crossover is centromeric to this SNP. The primer sequence for the genetic markers 
GTC_HBMJVtarker_5 and GTC_HBMJvlarker_7 is shown in Table 2 below. 
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TABLE 2 



Marker 


Primer (Forward) 


Primer (Reverse) 


GTC HBM Marker 5 


TTTTGGGTACACAATTCAGTCG 


AAAACTGTGGGTGCTTCTGG 


GTCJIBM_Marker_7 


GTGATTGAGCCAATCCTGAGA 


TGAGCCAAATAAACCCCTTCT 



5 "The iiidred described have several features of great intere^^^ 

that their bones. while very dense, have an absolutely nonnal shape. The outer dimensions of 
the skeletons of the HBM-affected individuals are normal, and, while medullary cavities are 
present, there is no interference with hematopoiesis. The HBM-affected members seem to be 
resistant to fracture, and there are no neurologic symptoms, and no symptoms of impairment 
10 ofanyorganorsystemfunctioninthemembersexainined. HBM-affected members of the 
kindred hve to advanced age without undue Uhiess or disabiHty. Furthemiore. the HBM 
phenotype matches no other bone disorders such as osteoporosis, osteoporosis pseudogUoma, 
Engehnann's disease. Ribbing's disease, hypeiphosphatasemia. Van Buchem's disease, 
meloAeostosis, osteopetrosis, pycnodysostosis, sclerostenosis. osteopoikilosis, acromegaly, 
15 Pagefs disease, fibrous dysplasia, tubular stenosis, osteogenesis imperfecta,- 

hypoparalhyroidism, pseudohypoparathyroidism, pseudopseudohypoparathyroidism, primary 
and secondary hyperparathyroidism and associated syndr<Jffies, hypercalciuria. meduUary 
carcinoma of the thyroid gland, osteomalacia and other diseases. Qearly, the HBM locus in 
this family has a very powerful and substantial role in regulating bone density, and its 
20 identification is an important step in understanding the pathway(s) that regulate bone density 
and the pathogenesis of diseases such as osteoporosis. 

In addition, older individuals carrying the HBM gene, and therefore expression of the 
HBM protein, do not show loss of bone mass characteristic of normal individuals. Moreover, 
individuals carrying the HBM gene have lower triglycerides, VLDLs, and LDLs and/or 
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increased HDLs. In other words, the HBM gene is a suppressor of osteoporosis and may 
lessen cardiovascular risk arteriosclerotic and/or atherosclerotic associated conditions. In 
essence, individuals carryiag the HBM gene are dosed with the HBM protein, and, as a result, 
lower levels of detrimental lipids {e.g., VLDL, LDL and triglycerides). This in vivo 
5 observation is strong evidence that treatment of normal individuals with the HBM gene or 
protein, or a fragment thereof, will ameliorate osteoporosis and arterio- or atherosclerotic 
conditions or diseases. 

IV, Physical Mapping 

To provide reagents for the cloning and characterization of the HBM locus, the 
10 genetic mapping data described above were used to construct a physical map of the region 
containing Zmaxl on chromosome llql3.3. The physical map consists of an ordered set of 
molecular landmarks, and a set of BAC clones that contain the Zmaxl gene region from 
chromosome llql3.3. 

Various publicly available mapping resources were utilized to identify existing STS 
15 markers (Olson et al. Science^ 245:1434-1435 (1989)) iu the HBM region. Resources 
included the GDB, the Whitehead Institute Genome Center,. dbSTS and dbEST (NCBI), 
lldb, the University of Texas Southwestern GESTEC, the Stanford Human Genome Center, 
and several .literature references (Courseaux et cd.. Genomics, 40:13-23 (1997), Courseaux et 
al,. Genomics, 37:354-365 (1996), Guru et al. Genomics, 42:436-445 (1997), Hosoda et al, 
20 Genes Cells, 2:345-357 (1997), James et al, Nat Genet, 8:70-76 (1994), Kitamura et al, 
DNA Research, 4:281-289 (1997), Lemmens et al.. Genomics, 44:94-100 (1997), Smith et al. 
Genome Res., 7:835-842 (1997)), Maps were integrated manually to identify markers 
mapping to the region containing Zmaxl, 
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Primers for existing STSs were obtained from the GDB or literature references are 
Usted in Table 3 below. Thus, Table 3 shows- the STS markers used to prepare the physical 
map of flie Zmaxl gene region. 



nio.«5nnr.ir> <wo ois2B9ia2_i_: 
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Novel STSs were developed either from publicly available genomic sequence or from 
sequence-derived BAG insert ends. Primers were chosen using a script which automatically 
performs vector and repetitive sequence masking using Cioss_match (P. Green, U. of 
Washington) and subsequent primer picking using Primer3 (Rozeri. Skaletsky (1996. 1997). 
Primers is available at www.genome.wi.mit edu/genome_software/other/primer3.html. 

Polymerase chain reaction (PGR) conditions for each primer pair were initially 
optimized with respect to MgGl. concentration. Hae standard buffer was 10 mM Tris-HCl ^ 
(pH 8.3). 50 mM KCl. MgGl^, 0.2 mM each dNTP, 0.2 each primer. 2.7 ngZ/zl human 
DNA, 0.25 U of AmpHTaq (Peridn Ehner) andMgCl^ concentrations of 1.0 mM, 1.5 mM. 
2.0 mM or 2.4 mM. Gychng conditions included an initial denaturation at 94°G for 2 
minutes followedby 40 cycles at 94-G for 15 seconds. 55-G for 25 seconds, and 72°G for 25 
seconds followed by a final extension at 72-G for 3 minutes. Depending on the results from 
the initial round of optimization the conditions were further optimized if necessary. 
Variables included increasing the amiealing temperature to 58°C or 60°C. increasing the 
cycle number to 42 and the anneahng and extension times to 30 seconds, and using 

AmpliTaqGold (Peridn Ehner). 

BAG clones (Kim a?.. Genomics. 32-.213-218 (1996). Shizuya a/.v^^^ 

Acad. ScL USA. 89:8794-8797 (1992)) containing STS markers of interest were obtamed by 
PGR-based screening of DNA pools from a total human BAG library purchased from 
Research Genetics. DNA pools derived from library plates 1-596 were used corresponding to 
nine genomic equivalents of human DNA. Hie initial screening process involved PGR 
reactions of individual markers against superpools. i.e., a mixture of DNA derived from aU 
BAG clones from eight 384-wenUbrary plates. For each positive supeqjool. plate (8). row 
(16) and column (24) pools were screened to identify a unique library address. PGR products 
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were electrophoresed m 2% agarose gels (iiigmaj contammg U.!) fig/ml etindium bromicie in 
IX TBE at 1 50 volts for 45 min. The electrophoresis units used were the Model A3-1 
systems from Owl Scientific Products. Typically, gels contained 10 tiers of lanes with 50 
wells/tier. Molecular weight markers (100 bp ladder. Life Technologies, Bethesda, MD) 
were loaded at both ends of the gel. Images of the gels were c£5>tured with a Kodak DC40 
CCD camera and processed with Kodak ID software.' The gel data were exported as tab 
delimited text files; names of the files included information about the library screened, the gel 
image files and the marker screened. These data were automatically imported using a 
customized Perl script into Filemaker™ pro (Claris Corp.) databases for data storage and 
analysis. In cases where incomplete or ambiguous clone address information was obtained, 
additional experiments were performed to recover a unique, complete library address. 

Recovery of clonal BAC cultures firom the library involved streaking out a sample 
from the library well onto LB agar (Maniatis et al^ Molecular Cloning: A Laboratory 
Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1982)) containmg 12.5 
fig/ml chloramphenicol (Sigma). Two individual colonies and a portion of the initial streak 
quadrant were tested with appropriate STS markers by colony PCR for verificatiorL Positive 
clones were stored in LB broth containing 12.5 jig/ml chloramphenicol and 15% glycerol at - 
TO^'C. 

Several different types of DNA preparation methods were used for isolation of BAC 
DNA. The manual alkaline lysis miniprep protocol listed below (Maniatis et al. Molecular 
Cloning: A Laboratory Manual Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 
(1982)) was successfully used for most applications, Le., restriction mapping, CHEF gel 
analysis, FISH mapping, but was not successfully reproducible in endsequencing. The 
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Autogen and Qiagen protocols were used speciticaUy tor B AU DMA preparation tor 
endsequencing purposes. 

Bacteria were grown in 15 ml Terrific Broth containing 12.5 ng/ml chloramphenicol 
in a 50 ml conical tube at 37°C for 20 hrs with- shaldiig at 300 rpm. The cultures were 
5 centrifugedinaSorvallRT600aDat3000rpm(~1800g)at4°Cforl5nmL The 

supernatant was then aspirated as completely as possible. In some cases cell peUets were 
. frozen at -20''C at this step for up to 2 weeks. The peflet was then vortexed to homogenize 
the cells and minimize clumping. 250 |il of PI solution (50 mM glucose, 15 mM Tris-HCl, 
pH 8, 10 mM EDTA, and 100 ng/ml ENase A) was added and the mixture pipetted up and 
10. down to mix. The mixture was then transferred to a 2 ml Eppendorf tube. 350 pi of P2 

solution (0.2 N NaOH. 1% SDS) was then added, the mixture mixed gently and incubated for 
5 min. at room temperature. 350 nl of P3 solution (3M KOAc, pH 5.5) was added and the 
mixture mixed gently until a white precipitate formed. The solution was incubated on ice for 
5 min. and then centrifiiged at 4*'C in a microfuge for 10 min. The supernatant was 
15 transferred care&Uy (avoiding the white precipitate) to a fresh 2 ml Eppendorf tube, and 0.9 
ml of isopropanol was added, the solution mixed and left on ice for 5 min. The samples were 
centrifuged for 10 min;, and the supernatant removed care&lly. PeUets were washed in 70% 
ethanol and air dried for 5 min. PeUets were resuspended in 200 fil of TE8 (10 mM Tris-HCl. 
pH 8.0, 1.0 mM EDTA), and BNase A (Boehringer Mannhdm) added to 100 fig/ml. 
20 Samples were incubated at 3TC for 30 min., then precipitated by addition of 

CaHjOzNa-SHiO to 0.5 M and 2 volumes of ethanol. Samples were centrifuged for 10 min., 
and the pellets washed with 70% ethanol foUowed by air drying and dissolving in 50 /A TE8. 
Typical yields for this DNAprep were 3-5 ng/15 ml bacterial culture. Ten to 15 pi were used 
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for Hindm restriction analysis; 5 ^1 was used for NotI digestion and clone insert sizing by 
CHEF gel electrophoresis. 

BACs were inoculated into! 5 ml of 2X LB Broth containing 12.5 ^g/ml 
. chloramphenicol in a 50 ml conical tube. 4 tubes were inoculated for each clone, Cultures 
5 were grown overnight H6hr) at 37°C with vigorous shaking (>300ipm). Standard 
conditions for BAG DNA isolation were foUowed as recommended by the Autogen 740 
manufacturer. 3 ml samples of culture were placed into Autogen tubes for a total of 60 ml or 
20 tubes per clone. Samples were dissolved finaUy in 100 jil TE8 with 15 seconds of shaking 
as part of the Autogen protocol. After the Autogen protocol was finished DNA solutions 
10 were transferred from each individual tube and pooled into a 2 ml ^pendoif tube. Tubes 
with large amounts of debris (cany over from the peUeting debris step) were avoided. The 
tubes were theii rinsed with 0.5 ml of TE8 successively and this solution added to the pooled 
material. DNA solutions were stored at 4 "C; clunking tended to occur iq>on freezing at - 
20°C 'n»isDNAwaseitheruseddirectlyforrestrictionm^ping,CaEFgel.analysisor 
15 FISH mapping or was further purified as described below for use in endsequencing reactions. 
The volume of DNA solutions was adjusted to 2 ml with TE8, samples were then 
niixed gently and heated at eS'C for 10 min. The DNA sol^itions were then centrifiiged at 
4''C for 5 min. and the supematants transferred to a 15 ml conical tube. The NaCl 
concentration was then adjusted to 0.75 M (-0.3 ml of 5 M NaCl to the 2 ml sample). The 
20 - total volume was then adjusted to 6 ml with Qiagen column equihbration buffer (Buffer 
QBT). The supernatant containing the DNA was then appHed to the column and allowed to 
enter by gravity floW. Columns were washed twice with 10 mi of Qiagen Buffer QC. Bound 
DNA was then eluted with four sqjarate 1 ml ahquots of Buffer QF kept at 65 °C. DNA was 
precipitated with 0.7 volumes of isopropanol (-2.8 ml). Each sample was then transferred to 
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4 individual 2.2 ml Eppendorf tubes and incubated at room temperature for 2 hr or overnight. 
Samples werecentrifixgedinamicrofugeforlOmin. at 4°C. The supernatant was removed 
carefuUy and 1 ml of 70% ethanol was added. Samples were centrifiiged again and because 
the DNA peUets were often loose at this stage, the supernatant removed carefully. Samples 
5 were centrifixged again to concentrate remaiuing liquid which was removed with a micropipet 
tip. DNA pellets were tiien dried in a desiccator for 10 min. 20 A^l of sterile distilled and 
deionized H,0 was added to each tube which was then placed at 4-C overnight The four 20 
fA samples for each clone were pooled and the tubes rinsed with another 20 ^1 of sterile 
distiUed and deionized H,0 for a final volume of 100 ^1- Samples were then heated at 65 "C 
10 for5min.andthenmixedgently. Typical yields were 2-5 A.g/60 ml culture as assessed by 
NotI digestion and comparison with uncut lambda DNA. 

3 ml of LB Broth containing 12.5 ;zg/ml of chloramphenicol was dispensed into 
autoclavedAutogen tubes. A single tube was used for each clone. For inoculation, glycerol 
stocks were removed from -lO'C storage and placed on dry ice. A small portion of the 
15 glycerol stock was removed from the original tube with a stedle toothpick and transferred 
into the Autogen tube; the toothpick was left in the Autogen tube for at least two minutes 
before discarding. After inoculation the tubes were covered.with tape making sure the seal 
was tight When all samples were inoculated, the tiibe rniits were transferred into an Autogen 
rack holder and placed into a rotary shaker at 37»C for 16-17 hours at 250 rpm. Following 
20 growth, standard conditions for BAG DNA preparation, as definedby tiie manufacturer, were 
used to program the Autogen. Samples were not dissolved in TE8 as part of tiie program and 
DNA pellets were left dry. When tiie program was complete, the tubes were removed from 
the output tray and 30 jil of sterile distilled and deionized H,0 was added directly to the 
bottom of Ihe tube. THe tubes were then gentiy shaken for 2-5 seconds and then covered witii 
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parafilm and incubated at room temperature for 1-3 hours. DNA ssnxpl^ were then 
transferred to an Eppendorf tube and used either directly for sequencing or stored at 4''C for 



later use. 



V. BAG Clone Characterization for Physical Mapping 
5 DNA samples prepared either by manual alkaline lysis or the Autogen protocol were 

digested with Hindin for analysis of restriction fragment sizes. This data were used to 
compare the extent of overlap among clones. Typically 1-2 ng were used for each reaction. 
Reaction mixtures included: IX Buffer 2 (New England Biolabs), 0.1 mg/ml bovine serum 
albmnin (New England Biolabs), 50 ^tg/ml RNase A (Boehringer Mamiheim). and 20 units of 
10 Hindm (New England Biolabs) in a final vohnne of 25 ^1. Digestions were incubated at 
ST-C for 4-6 hours. BAG DNA was also digested with NotI for estimation of insert size by 
GHEF gel analysis (see below). Reaction conditions were identical' to those for BGndlE 
except that 20 units of NotI were used. Six /.I of 6X Ficoll loading buffer containmg 
bromphenol blue and xylene cyanol was added prior to electrophoresis. 
15 Hindm digests were analyzed on 0.6% agarose (Seakem, FMG Bioproducts) in IX 

TBE containing 0.5 fig/ml ethidimn bromide. Gels (20 cm X 25 cm) were electrophoresed in 
a Model A4 electrophoresis unit (Owl Scientific) at 50 volts for 20-24 hrs. Molecular weight 
size markers included undigested lambda DNA. Hindm digested lambda DNA, and Haem 
digested _X174 DNA. Molecular weight markers were heated at 65 °G for 2 min. prior to 

20 .loading the gel. Images were captured with a Kodak DG40 GCD camera and analyzed with. 
Kodak ID software. 

NotI digests were analyzed on a CHEF DRH (BioRad) electrophoresis unit according • 
to the manufacturer's recommendations. Briefly. P/o agarose gels (BioRad pulsed field 
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grade) were prepared in 0.5X TBE, equilibrated for 30 minutes in the electrophoresis unit at 
14°C, and electrophoresed at 6 volts/cm for 14 hrs with circulation. Switching times were 
ramped from 10 sec to 20 sec. Gels were stained after electrophoresis in 0.5 jig/ml ethidium 
bromide. Molecular weight markers included undigested lambda DNA, HmdlH digested 
5 lambda DNA, lambda ladder PFG ladder, and low range PFG marker (all from New England 
Biolabs). 

BAG DNA prepared either by liie manual alkaline lysis or Autogen protocols were 
labeled for HSH analysis using a Bioprime labehng kit (BioRad) according to the 
manufacturer's recommendation with minor modifications. Approximately 200 ng of DNA 
10 was used for each 50 ^ll reaction. 3 ^1 were analyzed on a 2% agarose gel to determine the 
extent of labeling. Reactions were purified using a Sephaldex G50 spin, column prior to in 
situ hybridization. Metaphase FISH was performed as described (Ma et aL, Cytogenei. Cell 
Genet, 74:266-271 (1996)). 

VI. BAC Endsequencing 

15 The sequencing of BAC insert ends utilized DNA prepared by either of the two 

methods described above. The DYEnamic energy transferjjrimers and Dynamic Direct cycle 
sequencing kits fi-om Amersham were used for sequencing reactions. Ready made 
sequencing mix including the.M13 -40 forward sequencing primer was used (Catalog # 
US79730) for ttie T7 BAC vector terminus; ready made sequencing mix (Catalog # 

20 US79530) was mixed with the M13 -28 reverse sequencing primer (Catalog # US79339) for 
the SP6 BAC vector terminus. The sequencmg reaction mixes included one of the four 
fluorescently labeled dye-primers, one of the four dideoxy termination mixes, dNTPs, 
reaction buffer, and Thermosequenase. For each BAC DNA sample, 3 nl of the BAC DNA 
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sample was aJiquoted to 4 PGR strip tubes. 2 ^1 of one of the four dye primer/tennimtion 
mix combinations was then added to each of the four tubes. The tubes were then sealed and 
centrifuged briefly prior to PGR. Thennocycling conditions involved a 1 minute denaturation 
at 95°C, 15 second annealing at 45»C, and extension for 1 minute at 70°C for 35 total cycles. 
5 Ailer cycling the plates were centrifuged briefly to collect all the liquid to the bottom of the 
tubes. 5 Ml of sterile distilled and deionized H^O was then added into each tube, the plates 
sealed and centrifeged briefly again. The four samples for each BAG were then pooled 
together. DNA was then precipitated by adding 1.5^1of7.5MNH40Acand 100/^1 of- • 
20''G lOOo/o ethanol to each tube. Samples were mixed by pipetting up and down once. Ilie 
10 plates were then sealed and incubated on ice for 10 minutes. Plates were centrifuged ui a 
table top Haiaeus centrifuge at 4000 rpm (3,290 xg) for 30 minutes at 4»C to recover the 
DNA The supernatant was removed and excess liquid blotted onto paper towels. Pellets 
were washed by adding 100 ^1 of .20''C 70% ethanol into each tube and recentrifiiging at 
4000 rpm (3,290 xg) for 10 minutes at 4»C. The supernatant was removed and excess Hquid 
15 againremovedbyblottingonap^ertowel. Remaining traces of Uquid were removed by 
placing the plates inside down over a pq)er towel and centrifuging only untfl the centrifuge 
reached 800 rpm. Samples were then air dried at room ten5..erature for 30 min. Tubes were 
c^ed and stored dry at -20»C until electrophoresis. Immediately prior to electrophoresis 
the DNA was dissolved in 1.5 /zl of Amersham loading dye. Plates were then sealed and 
20 centri&gedat2000rpm(825xg). Theplateswerethenvortexedonaplateshakerforl.2 
minutes. Samples were then recentrifiiged at 2000 rpm (825 x^ briefly. Samples were then 
heated at 65''C for 2 min. and immediately placed on ice. Standard gel electrophoresis was 
performed on ABI 377 fluorescent sequencers according to the manufacturer's 
recommendation. 
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V3L Sub-cionmg and Sequencing of HBM BAG DNA 

The physical map of the gene region provides a set of BAG clones that contain 

within them'theZm^xi gene and the HBM gene. DNA sequencing of several of the B AGs 
from the regionhasbeen completed. TheDNA sequence data isaunique reagent that 

5 includesdatathatonesMnedin1heartcan\.etoidentifytheZma.igen^ 

or toprepare probes to identic the gene(s), or to identifyDNAsequencepolymorphisms^t 

identify the gene(s)- 

BAC DNA was isolated according to o«= of two protocols, either a Qiagen 
p^caticn of BAC DNA (Qiagen. Inc. as describ»l in the product literal) or a manual 
10 purificationwhiohisamodifica.ionoftes.andatdalkalinelysis/CesiumCMoride 

p^arationofpUs^dPNA(s..e.g,Auaub.l«ai,Cu^iVo,<^i.toA/ofe<»/- 
Biolos,. Iota Waey & Sons (1997)). Briefly for me manual protocol, cells were peUeted. 

tended in GTE (50 n^ glucose. 25 mM Tri^ (pH 8), 10 mM EDTA) and We 
(50mg/ml solution). foUowedbyNaOH/SDS(l%SDS/0.2NNaOH)anda.enan ice-cold 

15 soIutionof3MKOAc(pH4.5.4.8). KnaseA was added to Are filtered supematan, followed 
byProteinaseKandZOyoSDS. Tie DNA was d»n precipitated with isopropanol.dr.ed and 
^endedinTE(10mMTris.lmMEDTA(pH8.0)). The BAC DNA was tefl^er 
p^ed by Cesium Chloride density gradient centtifugation (Ausubel e, oL. Curren, 

Protocob in Molecular Biology, John Wiley & Sons (1997)). 
20 ' FoUowingi^lation,flieBACDNAwasshear«lhydrodynamicallyusmganHPLC 

Trena. in Bi^,^. SoL. 22:273-274 (1997)) to an insert =ize of 2000-3000 bp. 
■ Aftershearing.^eDNAwasconcen^edandsepaxatedonastandardl%agarosegeL A 

. sin^e ftacdon. corresponding to ^ approximate size, was excised ftom the gel and purified 
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by electroelution (Sambxook et al. Molecular Cloning: A Laboratory ManuaU Cold Spring 
Harbor Laboratory, Cold Spring, NY (1989)). 

The purified DNA fragments were then blunt-ended using T4 DNA polymerase. The 
blunt-ended DNA was tiien ligated to unique BstXI-linker adapters (5'- 
GTCTTCACCACGGGG and 5* GTGGTGAAGAC in 100-1000 fold molar excess); These 
linkers were complimentary to the BstXI-cut pMPX vectors (constructed by the iaventors), 
while the overhang was not self-complimentaiy. Therefore, the linkers would not 
concatemerize nor woxild the cut-vector religate itself easily. The linker-adapted inserts were 
separated firom the unincorporated linkers on a 1% agarose gel and purified using GeneClean 
(BIO 101, Inc.). The linker-adapted insal was tiien ligated to a modified pBlueScript vector 
to construct a "shotgtm" subclone library. The vector contained an out-of-fi:ame lacZ gene at 
the cloning site which became in-firame in tbe event that an adapter-dimer is cloned, allowing 
these to be avoided by their blue-color. 

AU subsequent steps were based on sequencing by ABD77 automated DNA 
sequencing methods. Only major modifications to the protocols are highlighted. Briefly, the 
hbrary was then transformed into DHSa competent cells (Life Technologies, Bethesda, MD, 
DH5a transformation protocol). It was assessed by plating onto antibiotic plates containing 
ampiciUin and IPTG/Xgal. Theplates were incubated overnight at 37° C. Successfial 
transfonnants were then used for plating of clones and picking for sequencing. The cultures 
were grown overnight at 37 °C. DNA was purified using a siUca bead DNA preparation (Ng 
et al, Nucl Acids Res., 24:5045-5047 (1996)) method. In this manner, 25 Mg of DNA was 
obtained per clone. 

These purified DNA samples were then sequenced using ABI dye-terminator 
chemistry. The ABI dye terminator sequence reads were run on ABD77 machines and the 
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^ was directly ^.ferr^ to UNIX macl^es foUcwfag toe tncking of th. s=ls. All reads 
were assen-bledusi^gPHEAP (P.Green. ^stractsofDOBH^Genon^eProgram 

Coa.rao.or-Gra.tee Wot^op V. Jan. 1996. p. 157) deW. parameters and quahty 
scores. The initial assembly was done a. Wold coverage and yielded an av«age of M5 
5 oonti.s FoUowingtteinitialassemb.y.miasing™.es(se,uencesftomclones.ha.only 
gave one suand reads) were identified and se^ wia. ABI «l^U=gy .o allow the 

Primers for walldng were selected usiiig a 
identification of additional overl^mgconngs. Pnmerstorw 

Oenon>el^erapeaticsprogran.Pickjrimernearti«endsof..eclones.ofaciHtat=gap 
dosnre. ,l.ese walks were sequenced using «.e selected clones and prinaers. I>atawere 
10 leassembledwifliPHRAPintosequencecontigs. 

vm Gene Identification by Computational Methods 

FoUowing assen^ly o£*e BAC sequences tato contigs. ti. contigs were subjected to 
computational analyses to identic coding regions and «gions beari^ DKA sequence 

.tailarity to taown genes. IHs protocol included fine following steps. 
15 1 Degapd.econtigs:ti.esequencecontigsofUncon.ainsymbols(deno.edbya 

periodsymbol)thatrepresentlocationswheretheindividnal:ABIse,uencereadshave 

■ Msertions or deletions. Prior to automated computational analysis of the contigs. the petiods 

wereremoveA The original data was maintained &rfetoe«fer.noe. 

w<.re "masked" witbin the sequence by using the 
2 BAG vector sequences were masjtea wmu" 

,0 program'crossmatcb,mGreen.b.tp.Vcbim«a.bi„.ec.wasl^on.eduNW^^ Sincetlte 

„ Ubraries construction detailed above leaves someBAC vector in ti« shotgun 
libraries.tiusprosramwasusedtocompareti>esequenceof*eBACcontigsto.heBAC . 



-53- 



WP 01/92891 



PCTAJSOl/16946 



vector and to mask any vector sequence prior to subsequent steps. Masked sequences were 
marked by an "X" in the sequence files, and remained inert during subsequent analyses. 

3 . E. coli sequences contaminating the BAG sequences were masked by 
comparing the BAG contigs to the entire E. coli DNA sequence. 

4. Repetitive elements known to be common in the human genome were masked 
usmg cross match. In this implementation of crossmatch, the BAG sequence was compared 
to a database of human repetitive elements (Jerzy Jeika, Genetic Information Research 

Institute, Palo Alto. OA). The masked repeats were marked by X and remained inert during " 
subsequent analyses. 

5. The location of exons within the sequmce was predicted using the MZEF 
computer program (Zhang, Proc. Natl Acad. ScL, 94:565-568 (1997)). 

6. The sequence was compared to the publicly available unigsae database 
(National Center for Biotechnology Infonnation, National Library of Medicine, 38A, 8N905, 
8600 Rockville Pike, Bethesda, MD 20894; www.ncbi.nlm.mh.gov) using the blas1n2 
algorithm (Altschul et al, Nucl. Acids Res., 25:3389-3402 (1997)). The parameters for this 
search were: E=0.05, v=50, B=50 (where E is the expected probability score cutoff V is the 
number of database entries returned in the reporting of the jresults, and B is the number of 
sequence aHgnments returned in the reporting of the results (Altschul et al, J. Mol Biol, 
215:403-410(1990)). 

7. . The sequence was translated into protein for all six reading fi^es, and the 
protein sequences were compared to a non-redundant protein.database compiled from 
Genpept Swissprot PIR (National Center for Biotechnology loformation. National Library of 
Medicine, 38A, 8N905, 8600 Rockville Pike. Bethesda, MD 20894; V(nvwaicbijilmauh.gov). 
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rv rxr- \T-^f\ wTicrs E V. Biid B 316 defined as 

The parameters for this search were E=0.05. V-50. B- 50, wHere ii, v . an 

above. 

8. me BAG DNA sequence was compared to database of the cDNA clones 
derived from direct selection experiments (described below) using blastn2. (Altschul et al , 

5 Nucl Acids. Res., 25:3389-3402 (1997)). The parameters for this search were E=0.05, 

V=250, B=250, where E. V, and B are defined as above. 

J t« c#»nneTices of all other BACs from the 

9. The BAG sequence was compared to the sequences ox aii oiu 

HBM region on chromosome llql2-13 usmgblastt2 (Atochul e, al. HucL AM.. JUs.. 
.25:3389-3402 (1997».Thepar3n>.t=.fortos,ard.w«,B=0,05, V-50. B=50.wh=reE.V. 

10 and B are defined as above. 

10 Tl>=BACsequmcew.sc<»np3rcd.omes«Iuencesdcrivedftomtheendsof 

BACsftomfl»HBMregiononchroooso«en,12-13usingblasa2(Al.scW^«I.M.l 

AcUis. iie... 25:33S9.3402 a997)).Thepa«me.«sfor to search v,«cE=O.O5.V=50.B=5O. 

»*ere E, V, and B are defined as above. 
. 11 The BACse<raence was compared to the Genbank database (Natio.=alC«.ter 

rorBio.ecbnolog.Mbrmation.NatianalLibraryofMeaichre.3SA.SN905.S600R^^ 

Pike Bethesd^MD 20894; ™bia>l^"a.gov) -i-8.bUs..^ C^'-" 
AcUs. to.. 253389-3402 (1997)). lieparameters for this search were E=0.05. V^O. B=50. 

where E, V, and B are defined as above. 

12 TheBACsequencewascon^aredtomeSTSdivisionofGeobankdatabase 

(Hatior^ center fbrBiotechnologyIn&,n«tion.NationalLibrary of Medicme. 38A 8N905. 
8600RockvineK^.Be,hesda.MD 20894; www^bUto.nih.gov) nsmg blasb^ (Altschul 
e, al.. 1997). ^ par^eters for Ms search were E=0.05. V=50. B- 50. whereE. V. andB 
are defined as above. 
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13. The BAG sequence was compared to the depressed Sequence (EST) Tag 
Genbank database (National Center for Biotechnology Information, National Library of 
Medicine, 38A, 8N905. 8600 Rockville Pike. Bethesda, MD 20894; www.ncbi.nlnLnili.gov) 
using blastn2 (Altschnl et al, Nucl. Acids. Res., 25:3389-3402 (1997)). The parameters for 
5 this search were E=0.05, V=250, B=250, where E, V, and B are defined as above. 

DL Gene Identification by Direct cDNA Selection 

Piimaiy linkered cDNA pools were prepared from bone marrow, calvarial bone, 
femoral bone, kidney, skeletal muscle, testis and total brain. Poly (A) + RNA was prepared 
from calvarial and femoral bone tissue (Chomczyndd et al. Anal Biochem., 162:156-159 
10 (1987); D'Alessio et al. Focus, 9:1-4 (1987)) and the remainder of the uiRNA was 

purchased from Clontech (Palo Alto, California). In order to generate oUgo(dT) and random 
primed cDNA pools from the same tissue, 2.5 ng niRNA was mixed with oUgo(dT) primer 
in one reaction and 2.5 iig mRNA was mixed with random hexamers in another reaction, and 
bodi were converted to first and second strand cDNA according to manufecturers 
recommendations (Life Technologies, Bethesda, MD). Paired phosphorylated cDNA linkers 
(see sequence below) were annealed together by mixing ig,^ 1:1 ratio (10 /^g each) incubated 
at 65 "C for five minutes and allowed to cool to room temperature. 
Paired linkers oligol/2 

OLIGO 1 : 5'CTG AGC GGA ATT CGT GAG ACC3' (SEQ ID NO:12) 
OLIGO 2: 5TTG GTC TCA CGT ATT CCG CTC GA3' (SEQ ID NO:l 3) 

Paired linkers oligo3/4 

OLIGO 3: 5'CTC GAG AAT TCT GGA TCC TC3' (SEQ ID NO:14) 
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OLIGO 4: S'TTG AGG ATC CAG AAT TCT CGA G3' (SEQ ID N0:15) 

Paired linkers oligo5/6 

OLIGO 5: 5'TGTATGCGAATTCGCTGCGCG3'(SEQIDN0:16) 
OLIGO 6: 5'TrCGCGCAGCGAATrCGCATACA3'(SEQIDNO:17) 

5 Paired linkers oligo7/8 

OLIGO 7: 5'GTC CAC TGA ATT CTC AGT GAG3' (SEQ JD N0:18) 
OLIGO 8: 5'TTG TCA CTG AGA ATT CAG TGG ACS' (SEQ ID NO:19) 

Paired linkers oligol 1/12 

OUGOll: 5'GAATCCGAATrCCTGC3TCAGC3'(SEQIDNO20) 

10 0UG012: 5TrGCrGACCAGGAATTCGQATTC3'(SEQIDKO-^l) 

Liters wer.Hga.ed to all aligofdT) and random pmn«lcDNA pools (s« Mow) according 
to mamifecturers instructions (Life Technologies, Bethesda, MD). 

OUgo 1/2 was Ugat«l to cHgo(dT) and random primed cDNA pools prepared ftom 
bonemarrow. Ollgo 3/4 was ligated to oligo(dn and random primed oDNA pools prepared 
15 fl<imcalvarialbone. OUgo 5/6 was Ugated to oUgo(dl^ and random primed cDNA pools 
preparedftombrainz^sJcetataln^le. Oligo 7/8 was iigated to oUgo(dTO and random 
primed cDNA pools prepared fiom Hdney. Oligo 11/12 was ligated to oUgo(dI) and random 
primed oDNA pools prepared fiom femoral bone. 

The CDNA pooU were evaluated for leng* dishibudon by PGR amplification using 1 
20 ^ofal:l,l:10.andl:lOO<mudonoftheUgationreaction.respecttvdy.PCB.reacfions 

were performed in a Perkin Ehner 9600, each 25 ^1 volume reaction contained 1 m of DNA, 
10 mMTris-HCl (pH 8.3), 50 mMKCl, 1.5 mMMgC12. 0.001% gelatin, 200 mM each 



57- 



wo 01/92891 

PCT/USOl/16946 



dNTPs, 10 AtM primer and 1 unit Taq DNA polymerase (Peridn Elmer) and was amplified 
under the following conditions: 30 seconds at 94'C, 30 seconds at 60°C and 2 minutes at 
72°C for 30 cycles. The length distribution of the ampUfied cDNA pools were evaluated by 
electrophoresis on a 1% agarose geL The PGR reaction that gave the best representation of 
5 the random primed and oligo(dT) primed cDNA pools was scaled up so that -2-3 ng of each 
cDNA pool was produced. The starting cDNA for the direct selection reaction comprised of 
0.5 iig of random primed cDNAs mixed with 0.5 ng of oligo(dT) primed cDNAs. 

The DNA from the 54 BACs that were used in the direct cDNA selection procedure 
was isolated using Nucleobond AX columns as described by the manufecturer (The Nest 
10 Group, Lie). 

The BACs were pooled in equimolar amounts and 1 ^g of the isolated gaiomic DNA 
was labelled with biotin 16-UTP by nick translation in accordance with the manufecturers 
instructions (Bodmnger Mannheim). The incorporation of the biotin was monitored by 
methods that could be practiced by one skilled in the art (Del Mastro and Lovett, Methods in 

15 Molecular Biology, Humana Press Inc., NJ (1996)). 

Direct cDNA selection was performed using metiiods liiat could be practiced by one 
skilled in the art (Del Mastro and Lovett, Methods in Molecular Biology, Humana Press Inc., 
NJ (1996)). Briefly, the cDNA pools were multq>le!ced in two separate reactions: In one 
reaction cDNA pools from bone marrow, calvarial bone, brain and testis were mixed, and in 

20 the other cDNA pools from skeletal muscle, kidney and femoral bone were mixed. 

Suppression of the repeats, yeast sequences and plasnrid in the cDNA pools was performed to 
a Cot of 20. 100 ng of biotinylated BAG DNA was mixed with Ihe siq)pressed cDNAs and 
hybridized in solution to a Cot of 200. The biotinylated DNA and the cognate cDNAs was • 
c^tured on streptavidin-coated paramagnetic beads. The beads were washed and the primary 



-58- 



PCT/USOl/16946 

WO 01/92891 



1 * J Tu^oo,.-nMA 5 were PGR amplified and a second round of 
selected cDNAs were eluted. These cDNAs were ri^x^ omt. 

direct selection was perfomed. Tie product of toe second round of direct selection is 
referred to as the secondary selected ntaterial. A Galanin cDNA done, previously sho«n to 

^ • iQ.Aa-j 4.77 n 993^") was iised to monitor eniichment 
map to llql2-13 (Evans, Genomics, 18:473-47/ (.iyy:>;;, u 

5 during the two rounds of selection. 

ne secondary selected material torn bone mam>w, calvarial Bone, femoral bone. 
Hdney.skeletal muscle, testis and totalbrain was PCRsmplifiedusingmodifiedprimers of 

oUgos 1. 3. 5. 7 and U. shownbelow. and cloned into the TOG vector pAMPlO (Life 
Technologies. Bethesda. MD). in accordance witb the manuS^ture^s recommendations. 

10 Modified plimex sequences: 

OUgol-COA: S^CUACUACUACUACTG AGC GGAATTCOT GAG ACC3. (SEQID 
NO:22) 

OUgo3-CUA:5.CUACUACUACUACTCGAGAATTCTGGATCCTC3. (SEQID 

KO:23) 

15 OUgoS^A: 5.CUACUACUACUATGTATGCGAATTCGCTGCGCG3.(SBQID 
TSfO:24) 

OUgo7-CUA:5.CUACUACUACUAGTCCACTGAATICICAGTGAGnSEQm 
KO:25) 

OUgon-CO^ 5-CUA CUA CUA C0A GAA TCC GAa'tTC CTG GTC AGC3' (SEQ n> 
20 NO:26) 

ne cloned secondary selected material. &om each tissue source, was tiansfomred into 
MAX Efficiency DH5a Competent Cdls CUfe Technologies. Befhesda. MD) as 
■ recommended by the manufacU^rer. 3S4 colonies were picked ftom «.h formed source 
and arrayed into four 96 weU microtiter plates. 
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All secondarily selected cDNA clones were sequenced using Ml 3 dye primer 
terminator cycle sequencing kit (AppUed Biosystems), and the data collected by the ABI 377 
automated fluorescence sequencer (Applied Biosystems). 

All sequences were analyzed using the BLASTN, BLASTX and FASTA programs 
(Altschul et aL, J. Mol Biol, 215:403-410 (1990), Altschul et aL, Nucl. Acids. Res., 25:3389- 
3402 (1997)). The cDNA sequences were compared to a database containing sequences 
derived from human repeats, mitochondrial DNA, ribosomal RNA, E. coli DNA to remove 
background clones from the dataset using flie program cross_match. A further round of 
comparison was also performed using the program BLASTN2 against known genes 
(Genbank) and the BAG sequences from the HBM region. Those cDNAs that were >90% 
homologous to these sequences were filed according to the result and the data stored in a 
database for further analysis. cDNA sequences that were identified but did not have 
significant similarity to the BAG sequences from the HBM region or were ehminated by 
cross_match were hybridized to nylon membranes v^daich contained the BACs from the HBM 
region, to ascertain whether they hybridized to the target. 

Hybridization analysis was used to m^ flie cDNA clones to the BAG target that 
selected them. The BAGs that were identified from the HBM region were arrayed and grown 
into a 96 well microtiter plate. LB agar containing 25 \ig/va!L kanamydn was poured into 96 
well microtiter plate Uds. Once the agar had soKdified, pre-cut Hybond N+ nylon membranes 
(Amersham) were laid on top of the agar and the BACs were stamped onto the membranes in 
duplicate using a hand held 96 well rq)lica plater (V&P Scientific, Inc.). The plates were 
incubated overnight at 37°C. The membranes were processed according to the manufecturers 
recommendations. 
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The cDNAs that needed to be mapped by hybridization were PGR amplified using the 
relevant primer (oligos 1. 3, 5. 7 and 11) that would amplify that clone. For this PGR 
amphfication, the primers were modified to contain a linkered digoxigenin molecule at the 5' 
of the oUgonucleotide. lUe PGR amphfication was performed ^der the same conditions as 
5 describedinPrepaxationofcDNAPoolsCabove). The PGR products were evaluated for 
quahty and quantity by electrophoresis on a 1% agarose gel by loading 5 jxl of the PGR 

reaction. lUe nylon membranes containing the stamped BACs were individuaUy pre- 
hybridized in 50 ml conical tubes containing 10 ml of hybridization solution (5x SSPE, 0.5x 
Blotto, 2.5% SDS andlmMEDTA(pH8.0)). The 50 ml conical tubes were placed ma 

10 rotisserieoven(RobbinsScientific)for2hoursat65-G.25ngofeachcDNAprobewas 
denatured and added into individual 50 ml conical tubes containing the nylon membrane and 
hybridization solution. The hybridization was performed overnight at 65 ° G . IHe filters were 
washed for 20 minutes at 65»G in each of the following solutions: 3x SSPE. 0.1% SDS; Ix 
SSPE, 0.1% SDS and O.lx SSPE, 0.1% SDS. 

15 ' IHe membranes were removed ftom the 50 ml conical tubes and placed in a dish. 

Acetate sheets wereplacedbetweeneachmembrane to prevent them from sticking to each 

other. TlieincubationofthemembraneswiththeAnti.piG-APandGDP-Starwasp^^^^ 
accordingtomanufacturersrecommendationsCBoehringerMan^^ Tl.e membranes were 
capped in Saran wrap and exposed to Kodak Bio-Max X-ray film for 1 hour. 

20 X. cDNAaonmg and Expression Analysis 

To characterize the expression of the genes identified by direct cDNA selection and 
genomic DNA sequencing in comparison to the pubhcly available databases, a series of . 
e^erimentswereperfonnedto furlhercharacterize fixe genes in theHBMregi^^^ FirsV 
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oligonucleotide primers were designed for use in the polymerase chain reaction (PGR) so that 
portions of a cDNA, EST, or genomic DNA could be amplified from a pool of DNA 
molecules (a cDNA Hbrary) or RNA population (RT-PCR and RACE). The PGR primers 
were used in a reaction containing genomic DNA to verify that they generated a product of 
the size predicted based on the genomic (BAG) sequence. A nmnber of cDNA Ubraries were 
tiien examined for the presence of the specific cDNA or EST. The presence of a fragment of 
a transciiptidn mut in a particular cDNA Kbraiy mdicates a high piobabiUty that additional 
portions of the same transcription unit will be present as well. 

A critical piece of data that is required when characterizing novel genes is the length, 
in nucleotides, of the processed transcript or messenger RNA (inKNA). One skiUed in the art 
primarily determines the length of an mRNA by Northern blot hybridization (Sambrook et 
al.. Molecular Cloning: A Laboratory Manualy Cold Spring Harbor Laboratory, Cold Spring 
Harbor NY (1989)). Groups of ESTs and direct-selected cDNA clones that displayed 
significant sequence similarity to sequenced BACs in the critical region were grouped for 
convenience into ^rqximately 30 kilobase units. Within each 30 kilobase unit there were ' 
from one iqj to fifty ESTs and direct-selected cDNA cloiies which comprised one or more 
independent transcription units. One or more ESTs or dir^trselected cDNAs were used as 
hybridization probes to determine the length of flie mRNA in a variety of tissues, usmg 
commerciany available reagents (Multiple Tissue Northern blot; Qontech, Palo Alto, 
California) urider conditions recommended by the manufiactuTCT. 

Directionally cloned dDNA libraries fixm femoral bone, and calvaiial bone 
tissue were constructed by methods familiar to one skilled in the art (for example, Soares in 
Automated DNA Sequencing and Analysis, Adams, Fields and Venta:, Eds., Academic Press,- 
pages 1 10-1 14 (1994)). Bones were initially brokai into fragments with a hammer, and 
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the small pieces were frozeninUquidrutrogen and reduced toapowdermatissuepu^^^^^ 
(Spectrum Laboratory Products). RNA was extracted from the powdered bone by 
homogenizing the powdered bone with a standard Acid Guanidinimn 
Thiocyanate-Phenol-Chlorofoim extraction buffer (e.g. Chomczynski and Sacchi. Anal 
5^ocW, 162:156-159 (1987))usingapolytronhomogemzer(Brin3ananIns^^^ 

Additionally, human brain and lung total RNA was purchased from Clontech. PolyA BNA 
was isolated from total TGS[A using dynabeads-dT according to the manufacturer's 

recominendations (Dynal, Inc.). 

Fiist ^ cDNA symhesis was faitiatrf using an oUgonucleotito primer wift the 

sequent: 5-.AAcr<3GAAGAATrcQC£K5££22eAGGAATrrrrrrrrT^^ 

(SEQ ID NO:27). Tlis piimrr introduces a Nofl restriction site (underlined) atfte 3' end of 
to CDNA. First a^d second strand syndesis were perfonned using the "one-tube" cDNA 
synthesis kit as describedbythe anufac«r.r(Ufe Technologies. Bethesda. MD). Double 
s,«>ded cDNAs were treated with T4 polynucleotide Idnase to ensure that the ends of &e 
molecules were blunt (Scares. taAu,oma,edDm Seguencing ondAmfy^, Adams. Fields 
and Venter. Eds., Academic Press. NY. pages 110-U4 (1994)). and the blunt ended cDNAs 
werethensizeselectedby aBicgelcolumn(Huynh.r.;..to.X.W^ C/omng. Vol. 1. Glover. 
E4. IRL Press. Otford, pages 49-78 (1985)) or wifli a size-sep 400 sepharose column 
(Pharmacia, catalog # 27-5105-01). Only cDNAs of 400 base pairs or longer were used in 
: subse<pen. steps. EcoRI adapters (se<^ence: 5' OH-AATTCGGCACGAG-OH r (SEQ ID 
NO:28). and 5' p-CTCQTGCCG-OH 3' (SEQ ID NO:29)) were then Ugated to the double 
strandel CDNAS bymethodsfemiliartooncsldlledin the arKSoaxes. 1994). IheEcoEI 
adapters werettenremoved fiomthe 3' end of to cDNAby digesHon with Nod (Scares. 
1994) n>e CDNA was fl>en Ugated mto the plasmid vector pBluescipt H KS^ (S^ene. 
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La JoUa, California), and the ligated material was transformed into coli host DHIUJB or 
DH12S by electroporation methods familiar to one skilled in the art (Scares, 1994). After 
growth overnight at 31 ^C, DNA was recovered from the E, coli colonies after scraping the 
plates by processing as directed for the Mega-prep kit (Qiagen, Chatsworth, California). The 
5 quality of the cDNA libraries was estimated by coimting a portion of the total numbers of 
"primary transfomiants and determining the average insert size and the percentage of plasmids 
with no cDNA insert Additional cDNA libraries (human total brain, heart, kidney, 
leukocyte, and fetal brain) were purchased from Life Technologies, Bethesda, MD. 

cDNA hbraries, both oligo (dT) and random hexamer (N^) primed, were used for 
10 isolating cDNA clones transcribed within the HBM region: human bone, himian brain, human 
kidney and human skeletal muscle (all cDNA libraries were made by the inventors, except for 
skeletal muscle (dT) and kidney (dT) cDNA Hbraries). Four 10 x 10 arrays of each of the 
cDNA libraries were pr^ared as follows: the cDNA libraries were titered to 2.5 x 10* using 
primary transfomiants. The appropriate volume of frozen stock was used to inoculate 2 L of 
15 LB/ampicillin (100 mg/ml). This inoculated liquid culture was aliquotted into 400 tubes of 4 
ml each. Each tube contained approximately 5000 cfii. Thetubes were incubated at 30 °C 
overnight with gentle agitation. The cultures were grown to an OD of 0.7-0.9. Frozen stocks 
were prepared for each of the cultures by ahquotting 100 (A of culture and 300 fA of 80% 
glycerol. Stocks were frozen in a dry ice/eflianol bath and stored at -70 **C. The remaining 
20 culture was DNA prepared using the Qiagen (Chatsworth, CA) spin miniprep kit according to 
the manufacturer's instructions. The DNAs from the 400 cultures were pooled to make 80 
column and row pools. The cDNA libraries were detennined to contain HBM cDNA clones 
of interest by PGR. Markers were designed to amplify putative exons. Once a standard PGR. 
optimization was performed and specific cDNA libraries were detCTnined to contain cDNA 
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Clones of interest, the markers were used to screen the arrayed library. Positive addresses 
indicating the presence of cDNA clones were confirmed by a second PGR using the same 
markers. 

Once a cDNA Hbrary was identified as likely to contain cDNA clones corresponding 
5 to a specific transcript ot interest from the HBM region, it was manipulated to isolate the 
clone or clones containing cDNA inserts identical to flie EST or direct-selected cDNA of 
interest. Uris was accompUshed by a modification of the standard "colony screening- 
method (Sambrook et al. Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratory. Cold Spring Harbor NY (1989)). Specifically, twenty 150 mm LB+ampicillin 
10 agarplateswerespreaavnth20,000colonyfomnngunits(cfii)ofcDNAHbraryandthe 

colonies allowed to grow overnight at 37-C. Colonies were transferred to nylon filters 
(Hybond from Amersham, or equivalent) and duplicates prepared by pressing two filters 
together essentially as des«ibed (Sambrook al. Molecular Cloning: A Laboratory Manual, 
ColdSpringHarborLaboratory.ColdSpiingHafborNY(1989)). lUe "master" plate was 
15 then inc^ated an additional 6-8 hours to allow the colonies to grow back. THe DNA firom 
the bacterial colonies was &en affixed to the nylon filters by treating the filters sequentially 
with denaturing solution (0.5 NNaOH, 1.5 MNaCl) for twq minutes, neutralization solution 

(0.5 M Tris-Cl pH 8.0, 1.5 M NaCl) for two minutes (twice). The bacterial colonies were 
removed from the filters by washing inasolution of 2XSSC/0.1%SDS for one minute while 

20 mbbing with tissue paper. The filters were air dried and baked under vacuum at 80°C for 1-2 
hoturs. 

A cDNA hybridization probe was prepared by random hexamer labeling (Fineberg 
and Vogelstein, A^ial. Biochem., 132:6-13 (1983)) or by including gene-specific primers and 
norandomhexamersinthereaction(forsmallfragments). Specific activity was calculated 
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and was >5X10^ q)m/10^ of cDNA. The colony membranes were then prewashed m 10 
mM Tris-Cl pH 8.0, 1 M NaCl, 1 mM EDTA, 0.1% SDS for 30 minutes at 55 Following 
the prewash, the filters were prehybridized in > 2 ml/filter of 6X SSC, 50 % deionized 
formamide, 2% SDS, 5X Denhardfs solution, and 100 mg/ml denatured sahnon spenn DNA, 
at 42 °C for 30 minutes. The filters were then transferred to hybridization solution (6X SSC, 
2% SDS, 5X Denhardf s, 100 mg/ml denatured salmon spenn DNA) containing denatured 
a^^-dCTP-labelled cDNA probe and incubated at 42 °C for 16-18 hours. 

After the 16-1 8 hour incubation, the filters were washed under constant agitation in 
2X SSC, 2% SDS at room temperature for 20 minutes, followed by two washes at 65**C for 
15 minutes each. A second wash was performed in 0,5 X SSC, 0.5% SDS for 15 minutes at 
65° C. Filtars were then wrapped in plastic wrap and exposed to radiographic fihn for several 
hours to overnight. After film development, individual colonies on plates were aligned with 
the autoradiogr^h so that they could be picked into a 1 ml solution of LB Broth containing 
ampiciUin. After shaking at BT^'C for 1-2 hours, aliquots of the solution were plated on 150 
mm plates for secondaiy screening. Secondaiy screening was identical to primary screening 
(above) except that it was performed on plates containing ~250 colonies so that individual 
colonies could be clearly identified for picking. 

After colony screening with radiolabeled probes yielded cDNA clones, the clones 
were characterized by restriction endonuclease cleavage, PGR, and direct sequencing to 
confirm the sequence identity between the original probe and the isolated clone. To obtain 
the ftdl-length cDNA, the novel sequence from the end of the clone identified was used to 
probe the library again. This process was repeated until the length of the cDNA cloned 
matches that estimated to be fiiU-length by the northern blot analysis. 
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RT-PCR was used as another method to isolate Ml length clones. The cDNA was 
synthesized and amplified using a "Superscript One Step RT-PCR" kit (Life Technologies, 
Gaithersburg, MB). Th. procedure involved adding 1 .5 ng of BNA to the following: 25 ^tl of 
reaction mix provided wMchisaproprietary buffer mix withMgSO.anddl^P's,l^xl sense 

5 primer (10 pM) and 1^1 anti-sense primer (10 ^M). 1 l^l reverse transcriptase and Taq DNA 
polymerase mix provided and autoclavedwaterto a total reaction mix of 50 ^1. TT.e reaction 
was then placed in a thexmocycler for 1 cycle.at 50°C for 15 to 30 minutes, then 94«>C for 15 
seconds, 55-60°C for 30 seconds and 68-72°Cfor 1 minute per kilobase of anticipated 
product and finally 1 cycle of 72'C for 5-10 minutes. TTie san^le was analyzed on an 
10 agarosegel. Tlxe product was excised from the gel and purified from the gel (GeneClean, Bio 
101). nie purified product was cloned in pCim (General Contractor DNA Cloning 
System. 5 Prime - 3 Prime, Inc.) and sequenced to verify that flie clone was specific to the 
gene of interest 

Rapid Amplificalion of cDNA raids (RACE) was perfcmed foUowing the 
15 r^^^^. tastmctions using a Marathon cDNA Aniplification Kit (aontech. Palo AlW. 
CA) as a msttod for cloning the 5' and 3' ends of candidate genes. cDNA pools were 
prepared ftom total ENA by pe*tnnng first strand synfl.^. where a sample of totalRNA 

■ san^le was mixed with a modified oUgo (dT) primer, heated to 70-C. cooled on ice and 

■ Mowedby the addition ofi 5x first strand buffer. 10 mM dNlP mix. and AMV Reverse 

20 Transcriptase (20 U/Ml). -n-e tobe was incubated at 42-C for one hour and flteu the reacdon 
na,e was placed on ice. For second strand synthesis, the followmg compon«>ts were added 
to flie reacdon tube: 5x second sttandbuffer. 10 mM dNTP mix. stetUe water. 20x 
. seconds.randenzymecock.ailandthereactionnabewasincubatedatl6-Cferl.5bours. T4 

DNAPoIymerase was added to the reaction Bibe and incubated at 16»C for 45 minutes. The 
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second-strand synthesis was tenninated with the addition of an EDTA/Glycogen mix. The 
sample was subjected to aphenol/chlorofomi extraction and an ammonium acetate 
precipitation. The cDNA pools were checked for quahty by analyzing on an agarose gel for 
size distribution- Marathon cDNA adapters (Clontech) were then hgated onto the cDNA 
5 ends. The specific adapters contained priming sites that allowed for amplification of either 5^ 
or 3' ends, depending on the orientation of the gene specific primer (GSP) that was chosen. 
An aliquot of the double stranded cDNA was added to the following reagents: 1 0 iXSs/L 
Marathon cDNA adapter, 5x DNA ligation buffer, T4 DNA ligase. The reaction was 
incubated at 16°C overnight The reaction was heat inactivated to terminate the reaction. 

10 PCR was performed by the addition of the following to the diluted double stranded cDNA 
' pool: lOx cDNA PCR reaction buffer, 10 pM dNTP mix, 10 /^M GSP, 10 ^^M API primer 
(kit), 50x Advantage cDNA Polymerase Mix. Thermal Cycling conditions were 94°C for 30 
seconds, 5 cycles of 94''C for 5 seconds, 72**C for 4 minutes, 5 cycles of 94'*C for 5 seconds, 
TO^'C for 4 minutes, 23 cycles of 94^C for 5 seconds, 68°C for 4 minutes. After the first 

15 round of PCR was performed using the GSP. to extend to the end of the adapter to create the 
adapter primer binding site, e3q>onential antiplification of the specific cDNA of interest was 
observed- Usually a second nested PCR is performed to confirm the specific cDNA, The 
RACE product was analyzed on an agarose gel and then excised and purified fiom the gel 
(GeneClean, BIO 101). The RACE product was then cloned into pCTNR (Gen^ 

20 Contractor DNA Cloning System, 5' - 3', Inc.) and the DNA sequence determined to verify 
that the clone is specific to the gene of interest 
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XI. Mutation Analysis 

Comparative genes were identified using the above procedures and the exons from 
each gene v.ere subjected to mutation detection analysis. Comparative DNA sequencing was 
used to identify polymorphisms in HBM candidate genes from chromosome llql2.13. DNA 
5 sequences for candidate genes were ampUfied from patient lymphoblastoid cell lines, 

The inventors developed a method based on analysis of direct DNA sequencing of 
PGR products amplified from candidate regions to search for the causative polymorphism. 
n.e procedure consisted of three stages that used different subsets of HBM family to find 
segregating polymorphisms and a population panel to assess the frequency of the 
10 polymorphisms. Hxe family resources result from a single founder leading to the assumption 
that all affected individuals will share the same causative polymorphism. 

Candidate regions were first screened in a subset of the HBM family consisting of the 
proband,daughter.andhermother.fatoandbrother. Monochromosomal reference 
sequences were produced concurrently and used for comparison. The mother and daughter 
15 carried the HBM polymorphism in this nuclear family, providing the abiUty to monitor 
polymorphism-transmission. The net result is that two HBM chromosomes and six non- 
HBM chromosomes were screened. Hus allowed exclusion, of numerous frequent alleles. 
■ Only alleles exclusively present in the affected individuals passed to the next level of 
analysis. 

20 Polymorphisms that segregated exclusively with flie HBM phenotype in this original 

familywerethenre-examinedinanextendedportionoftheHBMpedigreeconsistm^ 
additionalnuclearfemilies. ITxese families consisted of five HBM and three unaffected 
individuals. The HBM individuals in this group included the two critical crossover 
mdividuals, providing the centromeric and telomeric boundaries of the critical region. 
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TracJdng the heredity of polymoiphisms between these individuals and their affected parents 
allowed for further rejBning of the critical region. This group brought the total of HBM 
chromosomes screened to seven and the total of non-HBM chromosomes to seventeen. 

When a given polymorphism continued to segregate exclusively with the HBM 
phenotype in the extended group, a population panel was then examined. This panel of 84 
persons consisted of 42 individuals known to have normal bone mineral density and 42 
individuals known to be unrelated but with untyped bone mineral density. Normal bone 
mineral density is within two standard deviations of BMD Z score 0. The second group was 
from the widely used CEPH panel of individuals. Any segregating polymorphisms found to 
be rare in this population were subsequently examined on the entire HBM pedigree and a 
larger population. 

Polymerase chain reaction CPCR) was used to generate sequencing templates &om. the 
HBM family's DNA and monochiomosomal controls. Enzymatic amplification of genes ■ 
within the HBM region on llql2-13 was .accomplished using the PGR with ohgonucleotides 
flan k ing each exon as well as the putative 5' regulatory elanents of each gene. The primers 
were chosen to amplify each exon as well as 15 or more base pairs within each intron on 
eidier side ofihe splice. All PGR primers were made as chimeras to facilitate dye primer 
sequencing. The M13-21F (5'- GTA A CGA CGG CCA GT -3') (SEQ ID NO:30) and - 
28REV (5'- AAC AGO TAT GAC CAT G -3') (SEQ ID N0:31) primer bindmg sites were 
built on to the 5' end of each forward and reverse PGR primer, respectively, during synthesis. 
150 ng of genomic DNA was used in a 50 fil PGR with 2UAnq)liTaq, 500 nM primer and 
125 |iM dNTP. Buffer and cycling conditions were specific to each prima: set TaqStart 
antibody (Glontech) was used for hot start PGR to minimize primer dimer formation. 10% of 
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the product was exanuBed on an agarose gel. The appropriate samples were diluted 1 :25 with 

deionized water before sequencing. 

Each PGR product was sequenced according to the standard Energy Transfer primer 
(Amersham) protocol. All reactions took place in 96 well trays. 4 separate reactions, one 
5 eachforA,C.GandTwereperfohnedforeachteniplate. Each reaction included 2 ^il of the 
sequencing reaction mix and 3 jil of diluted template. ITie plates were then heat sealed with - 
foil tape and placed in a thermal cycler and cycled according to the manufacturer's 
recommendation. Aftercycling,the4reactions were pooled. 3 nl of the pooled product was 
transferred to a new 96 well plate and 1 jU of the manufecturer-s loading dye was added to 
10 each weU. All 96 well pipetting procedures occurred on a Hydra 96 pipetting station 

(Robbins Scientific, USA). 1 ^1 of pooled material was directly loaded onto a 48 lane gel 
nmning on an ABI 377 DNA sequencer for a 10 hour, 2.4 kV run. 

• Polyphred (University of Washington) was used to assemble sequence sets for 
viewing wilii Consed (University of Washington). Sequences were assembled in groups 
15 representingaBrelevantfannlymembersandcontrolsforaspecifiedtargetre^^^ -miswas 
done separately for each of the three stages. Forward-and reverse reads were included for 
eacb individual along with reads from the monochromosom^ templates and a color annotated 
reference sequence. Po/j^W indicated potential polymorphic sites with a purple flag. Two 
readers independentiy viewed each assembly and assessed the vaUdity of tiie purple-flagged 
20 sites. 

A total of 23 exons present in the matiire mRNA and several other portions of the 
primary transcript were evahiated for heterozygosity in the nuclear family of two HBM- 
affected and two unaffected individuals. Twenty-five single nucleotide polymorphisms 
(SMPs) were identified, as daown in the table below. 
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TABLE 4: Single Nucleotide Polymorphisms in the Zmaxl gene and Environs 





Exon Name 


Location 


Base Change 




b200e2 1 -h_Coiitigl_l .nt 


69169 (309G) 


C/A 




b200e2 1 -h_Contig4_12 mt 


27402 (309G) 


AJG 


5 


b200e2 1 -h_Contig4_13 Jit 


27841 (309G) 


T/C 




b200e2 1 -h._Contig4_16.nt 


35600 (309G) 


A/G 




b200e21-h_Contig4_21 .nt 


45619 (309G) 


G/A 




b200e2 1 -h_Contig4_22.nt-a 


46018 (309G) 


T/G 




b200e2 1 -h_Contig4_22.nt-b 


46093 (309G) 


T/G 


10 


b200e21-h_Contig4_22.nt-c 


46190 (309G) 


A/G 




b200e21-h_Contig4_24.iit-a 


50993 (309G) 


T/C 




b200e21-h_Contig4_24.nt-b 


51124 (309G) 


C/T 




b200e21-h_Contig4_25jit 


55461 (309G) 


C/T 




b200e2 1 -h_Contig4_33 .nt-a 


63645 (309G) 


C/A 


15 


b200e21-h_Contig4_33.nt-b 


63646 (309G) 


A/C 




b200e21-h_Contig4_6lJit 


24809 (309G) 


T/G 




b200e21-h_Contig4_62.nt 


27837 (309G) 


T/C 




b200e21-h._Contig4_63jit-a 


31485 (309G) 


C/T 




b200e2 1 -h_Contig4_63 .nt-b 


31683 (309G) 


A/G 


20- 


b200e21-h_Contig4_9.nt 


24808T309G) 


T/G 




b527dl2-h._Contig030g_l Jit-a 


31340 (3d8G) 


T/C 




b527dl2-h_Coiitig030g_l jit-b 


32538 (308G) 


A/G 




b527dl2-h_Contig080C_2jit 


13224 (308G) 


A/G 




b527dl2-li_Contig087C_lJit 


21119 (308G) 


C/A 


25 


b527dl2-h_Contig087C_4jit 


30497 (308G) 


G/A 




b527dl2-h_Contig088C_4jit 


24811 (309G) 


A/C 




b527dI2-h._Coiitig089_lHPj3t 


68280 (309G) 


G/A 
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in addition to the polymoiptosms presented m l adle 4, two additional 
polymorphisms can also be present in SEQ ID NO:2. These is a change at position 2002 of 
SEQ ID NO:2. Either a guanine or an adenine can appear at this position. This 
pclymoiphism is silent and is not associated with any change in the amino acid sequence. 
5 The second change is at position 4059 of SEQ ID NO:2 corresponding in a cytosine (C) to 
thymine (T) change. This polymorphism results in a corresponding amino acid change from 
a valine (V) to an alanine (A). Other polymorphisms were found in the candidate gene exons 
and adjacent intron sequences. Any one or combination of the polymorphisms Usted in Table 
4 or the two discussed above could also have a minor effect on bone mass or lipid levels 
10 whenpresentinSEQIDNO:2. 

The present invention encompasses the nucleic acid sequences having the nucleic acid 
sequence of SEQ ID NO: 1 with the above-identified point mutations. 

Preferably, the present invention encompasses the nucleic acid of SEQ ID NO: 2. 
Specifically, a base-pair substitution changing G to T at position 582 in the coding sequence 
15 of Zmaxl (the HBM gene) was identified as heterozygous in all HBM individuals, and not 
found in the unaffected individuals (i.e., b527dl2-h_Contig087C_l .nt). Fig. 5 shows the 
order of the contigs in B527D12. The direction of transcription for the HBM gene is from left 
to right The sequence of contig308G of B527D12 is the reverse complement of the coding 
region to the HBM gene. Therefore, the relative polymorphism in contig 308G shown in 
20 Table 4 as a base change substitution of C to A is the complement to the G to T substitution 
in the HBMgene. This mutation causes a substitution of glycine 171 with valine (G171V). 

The HBM polymorphism was confirmed by examining the DNA sequence of different 
groups of individuals. In all members of the HBM pedigree (38 individuals), tiie HBM 
polymorphism was observed in the heterozygous form in affected (i.e., elevated bone mass) 
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individuals only (N=18). In unaffected relatives (N==20) (BMDZ<2.0) the HBM 
polymoiphism was never observed. To determine whether this gene was ever observed in 
individuals outside of the HBM pedigree, 297 phenotyped individuals were characterized at 
the site of the HBM gene. None were heterozygous at the site of the HBM polymorphism. In 
5 an unphenotyped control group, 1 of 42 individuals was observed to be heterozygous at 
position 582. Since this individual is deceased, their bone mineral density could not be 
obtained. Taken together, these data prove that the polymoiphism observed in flie kindred 
displaying the hi^ bone mass phenotype is strongly correlated with the G->T polymorphism 
at position 582 of Zmaxl. Taken together, these results establish that the HBM 
10 polymorphism genetically segregates with the HBM phenotype, and that both the HBM 
polymorphism and phenolype are rare in the general population. 

Xn. Allele Specific Oligonucleotide (ASO) Analysis 

The amplicori containing the HBMl polymorphism was PGR amplified using primers 
specific for the exon of interest The appropriate population of individuals was PGR 

15 amplified in 96 well microliter plates as follows. PGR reactions (20 pil) containing IX 
Promega PGR buffer (Cat #M1883 containing 1.5 mMMgGlj), lOOmM dNTP, 200 nM 
PGR primers (1863F: CCAAGTTGTGAGAAGTGG and 1864R: AATACCTGAAAGGAT 
AGCTG), 1 U Amplitaq, and 20 ng of genomic DNA were prepared and amplified under the 
foUowing PGR conditions: 94^G, 1 minute, (94°G, 30 sec; 58°G, 30 sec.; 72°G, 1 min.) X35 

20 cycles), 72**G, 5', 4^G, hold Loading dye was then added and 10 pi of the products was 

electrophoresed on 1.5% agarose gels containing 1 jig/mi efhidium bromide at 100-150 V for 
5-10 minutes. Gels were treated 20 minutes in denaturing solution (1.5 M NaGl, 0.5 N 
NaOH), and rinsed briefly with water. Gels were then neutralized in 1 M Tris-HGl, pH 7.5, 
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1 .5 M NaCl, for 20 minutes and rinsed with water. Uels were soaked m 1 0 X iii^C tor 2U 
minutes and blotted onto nylon transfer membrane (Hybond N+- Amersham) in lOX SSC 
overnight. Filters were the rinsed in 6X SSC for 10 minutes and UV crosslinked. 

The aUele specific oUgonucleotides (ASO) were designed with the polymoiphism 
5 approximately in the middle. OUgonucleotides were phosphate free at the S'end and were 
purchased from Gibco BRL. Sequences of the oligonucleotides are: 
2326 Zmaxl .ASO.g: AGACTGGGeTGAGACGC 
2327ZmaxlASO.t: CAGACTGGGITGAGACGCC 
The polymorphic nucleotides are underlined. To label the oUgos, 1.5 jil of 1 p.g/p.1 ASO 
10 oUgo (2326.Zmax.l.ASO.g or 2327.Zmaxl.ASO.t), 11 jil ddHA 2 jil lOX kinase forward 
bujSer, 5 nl y-^-ATP (6000 Ci/mMole), and 1 jil T4 polynucleotide kinase (10 U/jil) were 
mixed, and the reaction incubated at 37-C for 30-60 minutes. Reactions were then placed at 
95°C for 2 minutes and 30 ml H^O was added. The probes were purified using a G25 
microspin column (Pharmacia). 
15 Blots were prehybridized in 10 ml 5X SSPE, SXDenhardt's, 2% SDS. and 100 ^g/ml, 

denatured, sonicated salmon sperm DNA at 40°C for 2 hr. The entire reaction mix of kinased 
oHgowas&enaddedtol0mlfreshhybridizationbu£Fer(5XSSPE.5XDenhardts.2%SD« 

hybridized at 40°C for at least 4 hours to overnight 

All washes done in 5X SSPE, 0.1 % SDS. The first wash was at 450C for 15 minutes; 
20 thesolutionwasthenchangedandthefilterswashedSO-CforlSminutes. Filters were then 
exposed to Kodak biomax film with 2 intensifying screens at -70°C for 1 5 minutes to 1 hr. If 
necessary the filters were washed at SS-'C for 15 minutes and exposed to fihn again. Filters 
were stripped by washing in boiling O.IX SSC, 0.1% SDS for 10 minutes at least 3 times. 
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The two films that best captured the allele specific assay with the 2 ASOs wore converted 
into digital images by scanning them into Adobe PhotoShop. These images were overlaid 
against each other in Graphic Converter and. then scored and stored in FileMaker Pro 4.0 (see 
Fig. 9). 

5 XfTL CeUuIar Localization of Zmaxl 

A. Gene Expression in Rat tibia by nan ispiopic In Situ Hybridization 

In situ hybridization was conducted by Pathology Associates Internationa] (PAX), 
Frederick, MD. This study was undertaken to determine the specific cell types that express 
the Zmaxl gene in rat bone with particular emphasis on areas of bone growth and remodeling. 
10 Zmaxl probes used in this study were geiierated fix^m both human (HuZmaxl) and mouse 
(MsZmaxl) cDNAs, which share an 87% sequence identity. The homology of human and 
mouse Zmaxl with rat Zmaxl is unknown. 

For example, gene expression by non-isotopic in situ hybridization was performed as 
follows, but other methods would be known to the skilled artisan. Tibias were collected fix)m 
15 two 6 to 8 week old female Sprague Dawley rats euthanized by carbon dioxide; asphyxiation. 
Distal ends were removed and proximal tibias were snap frozen in OCT embedding medium 
with liquid nitrogen immediately following death. Tissues were stored in a -80'C fireezer. 

Probes for amplifying PGR products firom cDNA were prepared as follows. The 
primers to amplify PGR products fi-om a cDNA clone were chosen using pubfished sequences 
. 20 of both human LRP5 (Genbank Accession No. AB017498) and mouse LRP5 (Genbank 
Accession No. AF064984). ha order to minimize cross reactivity with other genes m the 
LDL receptor family, the PGR products were derived firom an intracellular portion of the 
protein coding regioa PGR was performed in a 50 jil reaction volume using cDNA clone as 
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template. ^CK reactions contained 1.5 mM MgClj, 1 unit Amplitaq, 200 pM dMTJ^s and 2 
liM each primer. PGR cycling conditions were 94*0 for 1 min., foUowed by 35 cycles of 
94°C for 30 seconds, 55°C for 30 seconds, 12''C for 30 seconds; followed by a 5 minute 
extension at 72°C. Thereactions werethenrunona 1.5 % agarose Tris-Acetate geL DNA 
was eluted from the agarose, ethanol precipitated aad resuspended in 10 mM Tris, pH 8.0. 
Gel purified PGR products were prepared for both mouse and human cDNAs and suppHed to 
Pathology Associates International for in situ hybridizations. 

The sequence of the human and mouse PGR primers and products were as follows: 
HiiTTian Zma-icl sense pr imer fHBM1253'> 

CCCGTGTGCTCCGCCGCCCAGTTC 
TTwTnan Zmavl antisens e primer (HBM1465) 

GGCTCACGGAGCtCATCATGGACTT 
HiimanZmsyl PHR product 

CCCGT6TGCTCCGCCGCCCAGTTCCCCTGCGCGCGGGGTCAGTGTGTGGACCTGCGCCTGCGCTGCGACGGCGA6 

GCAGACTGTCAGGACCGCTaVQACGAGGTGGACTGTGACGCCATCTGCCTGCCCAACCA^ 

GGCCAGTGTGTCCTCATCa^CAGOiGTGCGACTCCTTCCCCGACTGTATCGACGGCT 

CSAAATCACCAAGCCGCCCTCAGACGACaGCCCGGCCCACAGCAGTGCCATOT^^ 

TCTCTCTTCGTCATGGGTGGTGTCTATTTTGTGTGCCAGCGCGTGGTQTC 

CCCTTCCCGO^CGACrrATGTCAGCGGGACCCCGCACGTGCCCCTCAATTTCATAGCCCCGGGCGGOT^^ 
GGCCCCTTCACAGGCATCGCATGCGGAAAGTCCATGATGAGCTCCGTGAGCC 

Mouse Zmayl Sense primer rHBM1655'> 

AGCGAGGCCACCATCCACAGG 

Mf)i]i=!e Zmavl antisens e primer fflBM1656^ 

TCGCTGGTCGGCATAATCAAT 
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Mouse Zma xl PGR Product 

AGCAa^lGCCACCATCCACAGGATCTCCCTGGAGACTAACAAC^^ 
Gl^CCTCTGCACTGGACTTTGATGTGTCCAACAATCAC^^ 

CGAGCCTTCATGAATGGGAGCTCAGTGGAGCACGTGATTGAGTTTGGCCTCGACTACCCT 
5 GACTGGATGGGCAAGAACCTCTATTGGGCGGACACAGGGACCAACAGGATTGAGGTGGCCCGGCTGGATGGGC^ 
TTCCGGCAGGTGCTTGTGTGGAGAGACCTTGACAACCCCAGGTCTCrrGGCrCT 
TACTGGACTGAGTGGGGTGGCAA.GCCAAGGATTGTGCGGGCCTTCATGGATGG<^ 
GACAAGGTGGGCCGGGCCAACGACCTCACCATTGATTATGCCGACCAGCGA 

Riboprobes were synthesized as follows. The PGR products were reamplified with 

10 chimeric primers designed to incorporate either a T3 promoter upstream, or a T7 promoter 
downstream of the reampUfication products. The resulting PGR products were used as 
template to synthesize digoxigenin-labeled riboprobes by in vitro transcription (WT). 
Antisense and sense riboprobes were synthesized using T7 and T3 RNA polymerases, 
respectively, in the presence of digoxigenin-1 1-UTP (Boehringer-Mannheim) using a 

15 MAXIscript IVT kit (Ambion) according to the manufacturer. The DNA was then degraded 
with Dnase-1, and unincorporated digoxigenin was removed by ultrafiltration- Riboprobe 
integrity was assessed by electrophoresis through a denaturing polyacrylamide gel. 
Molecular size was compared with the electrophoretic mobility of a 100-100.0 base pair (bp) 
RNA ladder (Ambion). Probe yield and labeling was evaluated by blot immunochemistry. 

20 Riboprobes were stored in 5 fil aliquots at -80°Q 

The in situ hybridization was perfomied as follows. Frozen rat bone was cut into 5 
|xM sections on a Jung CM3000 ayostat (Leica) and mounted on adhesive shdes 
(histrumedics). Sections were kept in the cryostat at -20"C until all the slides were prepared 
in order to prevent mRNA degradation prior to post-fixation for 15 minutes in 4% 

25 paraformaldehyde. FoDowing post-fixation, sections were incubated with 1 ng/p.1 of either 
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antisense or sense riboprobe in PatHology Associates Jntemational (I'M) customized 
hybridization buffer for approximately 40 hours at 58'C. FoUowing hybridization, slides 
were subjected to a series of post-hybridization stringency washes to reduce nonspecific 
probe binding. Hybridization was visualized by immunohistochemistry with an anti- 
5 digoxigenin antibody (FAB fragment) conjugated to alkaline phosphatase. Nitroblue 

tetrazoUum chloride/bromochloroindolyl phosphate (Boehiinger-Maimheim), a precipitating 
alkaline phosphatase substrate, was used as the chromogen to stain hybridizing cells purple to 
nearly black, depending on the degree of staining. Tissue sections were counter-stained with 
nuclear fast red. Assay controls included omission of the probe, omission of probe and anti- 
10 digoxigenin antibody. 

Specific ceU types were assessed for demonstration of hybridization with antisense 
probes by visualizing a purple to black cytoplasmic and/or peri-nuclear staining indicating a 
positive hybridization signal for mRNA. Each cell type was compared to the xepHcate 
sections, wHch were hybridized with the respective sense probe. Results were considered 
15 positive if staining was observed with the antisense probe and no staining or weak 
backgroTmd with the sense probe. 

The cellular localization of the hybridiz^on signaljcor each of the study probes is 
summarized in Table 5. Hybridization for Zmaxl was primarily detected in areas of bone 
involved in remodeling, including the endosteum and trabecular bone within the metaphysis. 
20 Hybridization m selected bone lining cells of the periosteum and epiphysis were also 
observed. Positive signal was also noted in chondrocytes within the growth plate, 
particularly in the proliferating chondrocytes. See Figs. 10, 11 and 12 for representative 
photomicrographs of zn situ hybridization results. 
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TABLES 

Sunima37 of Zmaxl in situ hybridization in rat tibia 



Probe 


Site 


ISH SIGNAL 


HuZmaxl 


Epiphysis 






Osteoblasts 






Osteoclasts 






Growth Plate 


• 




resting chondrocytes 


- 




proliferating chondrocytes 






hypertrophic chondroc3^es 


- 




Metanhvsis 






osteoblasts 






osteoclasts 


+ 




Diaphysis 


- 




Endostexrm 






osteoblasts 






osteoclasts 






Periosteum 


- 














MsZmaxl 


Epiphysis 






Osteoblasts 






Osteoclasts 


- 




Growth Plate 






resting chondrocytes 






proliferating chondrocytes ■ ^ 


+ 




hypertrophic chondrocjrtes 






Metanhvsis 






osteoblasts 


+ 




osteoclasts 






Diaphvsis 






Endosteum 






osteoblasts 






osteoclasts 






Periosteum 


+ 









Legend: = hybridization signal detected = no hybridization signal detected 
*TSir - In situ hybridization 
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These studies confirm the positional expression of ZmaxVin cells involved in bone 
remodeling and bone formation. Zmaxl expression in the zone of proUferation and in the 
osteoblasts and osteoclasts of the proximal metaphysis, suggests that the Zmaxl gene is 
involved m the process of bone growth and mineralization. The activity and differentiation of 

5 osteoblasts and osteoclasts are closely coordinated dimng development as bone is formed and 
daiiing growth as well as in adult life as bone undergoes continuous remodeling. The 
fonnation of internal bone stmctures aadbone remodehng result from the coupling of bone 
resoiptioh by activated osteoclasts with subsequent deposition of new material by osteoblasts. 
Zmaxl is related to the LDL receptor gene, and thus may be a receptor involved in 

1.0 mechanosensation and subsequent signaling in the process of bone remodehng. Therefore, 
changes in the level of expression of this gene could impact on the rate of remodeling and 
degree of minerahzation of bone. Similar studies can be designed for in situ analysis of 
HBM or Zmaxl in otiier cells or tissues. 

XrV. Antisense 

15 Antisense oUgonucleotides are short synthetic nucleic acids that contain 

complementary base sequences to a targeted RNA Hybridization of the RNA in Jiving cells 
with the antisense oUgonucleotide interferes with RNA function and ultimately blocks protein 
expression. Therefore, any gene for which the partial sequence is known can be targeted by 
an antisense oUgonucleotide. 

20 Antisense technology is becoming a widely used research tool and will play an 

increasingly important role in the validation and elucidation of ther^eutic targets identified 
by genomic sequencing efforts. 
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Antisense tecimology was developed to mhibit gene expression by ublizmg an 
oligonucleotide complementary to the mRNA that encodes the target gene. There are several 
possible mechanisms for the inhibitory effects of antisense ohgonucleotides. Among them, 
degradation of mKNA by RNase H is considered to be the major mechanism of inhibition of 
protein function. This technique was originaUy used to elucidate the function of a target 
gene, but may also have therapeutic ^pUcations, provided it is designed carefully and 
properly. 

An example of materials and methods for preparing antisense oKgonucIeotides can be 
performed as follows. Preliminary studies have been undertaken m collaboration with 
Sequiter (Natick, MA) using the antisense technology m the osteoblast-like murine cell line, 
MC3T3. These cells can be triggered to develop along the bone differentiation sequence. An 
initial proliferation period is characterized by minimal expression of differentiation markers 
and initial synthesis of collagenous extraceUular matrix. CoUagen matrix synthesis is 
required for subsequent mduction of differentiation markers. Once the matrix synthesis 
begins, osteoblast marker genes are activated in a clear temporal sequence: alkaline 
phosphatase is induced at early times while bone sialoprotien and osteocalcin appear later m 
the differaatiation process. This temporal sequence of gene e:q)ression is useful in 
monitoring the maturation ^d mineralization process. Matrix mineralization, which does not 
begin until several days after maturation has started, involves deposition of mineral on and 
within coUagen fibrils deep within the matrix near the ceU layer-culture plate interface. The 
collagen fibril-associated mineral formed by cultured osteoblasts resembles that found in 
woven bone in vivo and therefore is used frequently as a study reagent 

MC3T3 cells were transfected with antisense ohgonucleotides for the first week of the. 
differentiation, according to the manufecturer's specifications (U.S. Patent No. 5,849,902). 
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lb& oJagonucleotides designed tor Zmaxl are given below: 
10875; AGUACAGCXJUCUUGCCAACCCAGUC 
10876: UCCXJCCAGGUCGAUGGUCAGCCCAU 
10877: GUCUGAGUCCGAGUUCAAAUCCAGG 
5 Figure 13 shows the results of antisense inhibition of Zmaxl in MC3T3 cells. The three 

oUgonucleotides shown above were transfected into MC3T3 and KNA was isolated according 
to standard procedures. Northern analysis clearly shows markedly lower steady state levels 
of the Zmaxl transcript while the control gene GAPDH remained unchanged. Thus, 
antisense technology using the primers described above aUows for the study of the role of 
10 Zmaxl expression on bone biology. Sinnlar primers can be used to study Zmaxl expression 
and its ability to regulate lipid levels in an animal. 

The protein encoded by Zmaxl is related to the Low Density Lipoprotein receptor 
(LDL receptor). See, Goldstein et aL, Ann. Rev. CellBiology, 1:1-39 (1985); Brown a/.. 
Science, 232:34-47 (1986). The LDL receptor is responsible for uptake of low density 
15 hpoprotein. a lipid-protein aggregate that includes cholesteroL Individuals with a defect in 
the LDL receptor are deficient in cholesterol removal and tend to develop artherosclerosis. In 
addition. ceUs with a defective LDL receptor show increased production of cholesterol, in 
part because of altered feedback regulation of cholesterol synflietic enzymes and in part 
because of increased transcription of die genes for these enzymes. In some ceU types, 
20 cholesterol is a precursor for die formation of steroid homiones. 

Thus, the LDL receptor may, directly or indirectly, function as a signal transduction 
protem and may regulate gene expression. Because Zmaxl is related to the LDL receptor, 
tiiis protein may also be involved in signaling between cells in a way tiiat affects bone 
remodeling as well as regulate Upid levels and therefore Upid-mediated diseases. 

V 
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rue giycme 17 1 ammo acid is likely to be important for the fimction ofZmaxl " 
because tbis amino acid is also found in the mouse homologue of Zmaxl . The closely related 
LRP6 protein also contains glycine at the conresponding position (Brown al. Biochemical 
and Biophysical Research Comm., 248:879-888 (1988)). Amino acids that are important in a 
5 protein's structure or function tend to be conserved between species, because natural selection 
prevents mutations with altered amino acids at important positions from arising. 

In addition, the extraceUular domain of Zmaxl contains four repeats consisting of five 
YWT motife followed by an EFG motif. This SYWT+EGF repeat is likely to fomi a distinct 
folded piotem domain, as this repeat is also found in the LDL receptor and other LDL 
10 receptor-related proteins. The first three 5YWT+EGF repeats are very similar in their 

structure, while the fourth is highly divergent Glycine 171 occurs in the central YWT motif 
of the first 5YWT+EGF repeat in Zmaxl. IHe other two similar SYWT+EGF repeats of 
Zmaxl also contain glycine at the coiresponding position, as does the 5YWT+EGF repeat in 
the LDL receptor protein. However, only 17.6% of the amino acids are identical among the 
15 first three 5YWT+EGF repeats in Zmaxl and the single repeat in the LDL receptor. ITiese 
observations indicate that glycine 171 is essential to the function of this repeat, and mutation 
of glycine 171 causes a functional alteration of Zmaxl. Ite cDNA and peptide sequences are 
shown in Figs, 6A-6E. The critical base at nucleotide position 582 is indicated in bold and is 
underlined. 

20 Northern blot analysis (Figs. TA-B) rev^s that Zmaxl is expressed in humaa bone 

tissue as weH as nmnemus other tissues. A multiple-tissue Northern blot (Clontech, Palo 
Alto, CA) was probed with exons firom Zmaxl. As shown in Fig. 7A, the 5.5 kb Zmaxl 
transcript was hi^y expressed in heart, kidney, lung, liver and pancreas and is expressed at . 
lower levels in skeletal muscle and brain. A second northern blot, shown in Fig. 7B, 
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confirmed the transcript size at 5.5 kb, and indicated that Zmaxl is expressed in bone, bone 
marrow, calvaria and human osteoblastic cell lines. 

Taken together, these results indicate that the HBM polymorphism in the Zmaxl gene 
is responsible for the HBM phenotype, and that the Zmaxl gene is important m bone 
5 development In addition, because mutation of Zmaxl can alter bone mineralization and 
development as well as lipid levels, it is likely that molecules that bind to Zmaxl may 
usefully alter bone development and Hpid levels. Such molecules may include, for example, 
small molecules, proteins, RNA aptamers, peptide aptamers, and the like. 

XV. Preparation of Nucleic Acids, Vectors, Transformations and Host Cells 
10 ■ Large amounts, of the nucleic acids of the present invention may be produced by 

repUcation in a suitable host cell. Natural or synthetic nucleic acid fragments coding for a 
desired fragment wiH be incorporated into recombinant nucleic acid constructs, usually DKA 
constrocts, enable of introduction into and replication in aprokaiyotic or eukaryotic cell. 
Usually the nucleic acid constructs will be suitable for repUcation in a uniceUular host, such 
15 as yeast or bacteria, but may also be intended for introduction to (with and without 

integration within the genome) cultured mammalian or plant or other eukaryotic cell lines. 
The purification of nucleic acids produced by the methods of the present invention is 
described, for example, in Sambrook et al. Molecular Cloning. A Laboratory Manual, 2nd 
Ed. (Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1989) or Ausubel et al, 
20 Current Protocols in Molecular Biology, J. Wiley and Sons, NY (1992). 

The nucleic acids of the present invention may also.be produced by chemical 
synthesis. e.g., by the phosphoramidite method described by Beancage et al., Tetra. Utts., 
22:1859-1862 (1981) or the triester mefliod according to Matteucci, et al, J. Am. Chem. Soc, 
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103:3185 (1981), and may be performed on commercial, automated oligonucleotide 
synthesizers. A double-stranded fragment may be obtained from the single-stranded product 
of chemical synthesis either by synthesizing the complementary strand and annealing the 
strands together under appropriate conditions or by adding the complementary strand using 
DNA polymerase with an appropriate primer sequence. 

Nucleic acid constructs prepared for introduction into a prokaiyotic or eukaryotic host 
may comprise a repHcation system recognized by the host, including the intended nucleic 
acid fragment encoding the desired protein, and will preferably also include transcription and 
translational initiation regulatory sequences operably linked to the protein encoding segment. 
Expression vectors may mclude, for example, an origin of replication or autonomously 
repUcating sequence (ARS) and expression control sequences, a promoter, an enhancer and 
necessary processing information sites, such as ribosome-binding sites, UNA splice sites, 
polyadaiylation sites, transcriptional teaminator sequences, and mRNA stabilizing sequences. 
Secretion signals may also be included where appropriate, whether from a native HBM or 
Zmaxl protein or from o&er receptors or from secreted proteins of the same or related 
species, which allow the protein to cross and/or lodge in cell manbranes, and thus attain its 
functional topology, or be secreted from the ceH. Such vectors may be prepared by means of 
standard recombinant techniques well known in the art and discussed, for example, in 
Sambrook et al.. Molecular Cloning. A Laboratory Manual 2nd Ed. (Cold Spring Harbor 
Laboratory, Cold Spring Harbor, NY (1989) or Ausubel et al. Current Protocols in 
Molecular Biology, J. Wiley and Sons, NY (1992). 

An appropriate promoter and other necessary vector sequences will be selected so as 
to be functional in Ihe host, and may iaclude, when ^propriate, those naturally associated . 
with Zmaxl or flBM genes. Examples of workable combinations of cell lines and expression 
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vectors are described in Sambrook et al. Molecular Ooning. A Laboratory Manual, 2nd iid. 
(Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1989) or Ausubel et al.. Current 
Protocols in Molecular Biology, J. Wiley and Sons, NY (1992). Many useful vectors are 
known in the art and may be obtained from such vendors as Stratagene, New England 
5 BioLabs, Promega Biotech, and others. Promoters such as the tip. lac and phage promoters, 
tRNA promoters and glycolytic enzyme promoters may be used in prokaryotic hosts. Useful 
yeast promoters include promoter regions for metallothionein, 3-phosphoglycerate kinase or 
other glycolytic enzymes such as enolase or glyceraldehyde-3-phosphate dehydrogenase, 
enzymes responsible for maltose and galactose utilization, and others. Vectors and promoters 
10 . suitable foruse in yeast expression are furtherdescribedinEP73,675A. Appropriate non- 
native mammaHan promoters might include the early and late promoters from SV40 (Fiers et 
al. Nature, 273:1 13 (1978)) or promoters derived from murine Moloney leukemia virus, 
mouse tumor virus, avian sarcoma viruses, adenovirus n, bovine papilloma virus or polyoma, 
m addition, the construct may be joined to an amplifiable gene (e.g., DHFR) so that multiple 
15 copies of the gene may be made. For appropriate enhancer and other expression control 
sequences, see also Enhancers and Eukaryotic Gene Expression, Cold Spring Haibor Press, 
Cold Spring Haibor, NY (1983). 

mile such expression vectors may repUcate autonomously, they may also replicate 
by being inserted into live genome of the host cell, by methods well known in the art. 
20 Expression and cloning vectors will likely contain a selectable marker, a gene 

encoding a protein necessary for survival or growth of a host cell transformed with the vector. 
THe presence of this gene ensures growth of only those host cells which express the inserts. 
Typical selection genes encode proteins that a) confer resistance to antibiotics or other toxic 
substances, e.g. ampicillin, neomycin, methotrexate, etc.; b) complement auxotrophic 
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aenciencies, or c; suppiy cnncai numerns noi avaiiaoie nrom compiex meoia, e.g., me gene 
encoding D-alanine racemase for Bacilli. The choice of the proper selectable marker will 
dqjend on the host cell, and appropriate markers for different hosts are weU known in the art. 

The vectors contabiing the nucleic acids of interest can be transcribed in vitro, and tiie 
resulting RNA introduced into the host cell by well-known methods, e.g., by injectiou (see, 
Kubo et al, FEBS Letts. 241:119 (1988)), or the vector can be introduced directly into host 
cells by methods well known in the art, which vaiy depending on the type of cellular host, 
including electroporation; transfection employing calcium chloride, rubidium chloride, 
calcium phosphate, DEAE-dextran, or other substances; microprojectile bombardment; 
Upofection; infection (where the vector is an infectious agent, such as a retroviral genome); 
and other methods. See generaUy, Sambrook et al, 1989 and Ausubel et al, 1992. The 
introduction of the nucleic acids into the host cell by any method known m the art. includmg 
those described above, will be refeired to herein as "transformation." The cells into which 
have been introduced nucleic acids described above are meant to also include the progeny of 
such cells. 

Large quantities of the nucleic acids and proteins of the present invention may be 
prepared by expressing the Zmaxl or HBM nucleic acids or portions thereof in vectors or 
other expression vehicles in compatible prokaiyotic or eukaryotic host cells. The. most 
commonly used prokaryotic hosts are strains of Escherichia coli, although other prokaryotes, 
sudi as Bacillus subtilis or Pseudomonas may also be used. 

Mammalian or other eukaryotic host cells, such as those of yeast, filamentous fimgi, 
plant, insect, or amphibian or avian species, may also be useful for production of the proteins 
of the present invention. Propagation of mammalian cells in culture is per se well known. . 
See, Jakoby and Pastan (eds.). Cell Culture. Methods in Enzymology, volume 58, Academic 
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Press, Inc., Harcoijrt Brace Jovanovich, ^fY, (1979)). Examples of commonly used 
mammalian host cell lines are VERO and HeLa cells, Chinese hamster ovary (CHO) cells, 
and WD8, BHK, and COS ceU lines, although it will be appreciated by the skilled 
practitioner that other cell lines may be appropriate, e.g., to provide higher expression 

5 desirable glycosylation patterns, or other features. 

Clones are selected by using markers depending on the mode of the vector 
construction. The marker may be on the same or a different DNA molecule, preferably the 
same DNA molecule. In prokaiyotic hosts, the transfonnant may be selected, e.g., by 
resistance to ampicilhn, tetracycline or other antibiotics. Production of a particular product 

10 based on temperature sensitivity may also serve as an appropriate markrar. 

Prokaiyotic or eukaryotic cells transformed with the nucleic acids of the present 
invention wUl be useful not only for the production of the nucleic acids and proteins of the 
present invention, but also, for example, in studying the characteristics of Zmaxl orHBM 
proteins. 

15 Antisense nucleic add sequences arc usefiil in preventing or diminishing the 

expression of Zmaxl or HBM, as wiU be appreciated by one skilled in the art. For example, 
nucleic acid vectors containing all or aportion of die Zmaxl or HBM gene or other sequences 
firom the Zmaxl or HBM region may be placed under the control of a promoter in an 
antisense orientation and introduced into a ceU. E:q)ression of such an antisense construct 

20 within a cell will interfere with Zmaxl or HBM transcription and/or translation and/or 
rephcation. 

The probes and primers based on the Zmaxl and HBM gene sequences disclosed 
herein are used to identify homologous Zmaxl and HBM gene sequences and proteins in 
other species. These Zmaxl and HBM gene sequences and proteins are used in the 
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• diagnostic/prognostic, therapeutic and drug screening methods described herein for the 
species from which they have been isolated. 

XVI. Protein Expression and Purification 

Expression and purification of the HBM protein of flie invention can be performed 
5 essentially as outlined below. To facilitate the cloning, e^qpression and purification of 
membrane and secreted protein from the HBM gene, a gene expression system, such as the 
pET System (Novagen), for cloning and expression of recombinant proteins in£. coliwas 
selected. Also, aDNA sequence encoding a peptide tag, the His-Tap, was fiised to the 3* end 
of DNA sequences of interest to facilitate purification of the recombinant protein products. 

10 The 3' end was selected for fiision to avoid alteration of any 5' terminal signal sequence. 

Nucleic acids chosen, for example, from the nucleic acids set forth in SEQ ID NOS: 
1, 3 and 5-12 for cloning HBM were prepared by polymerase chain reaction fPCR). 
Synflietic oligonucleotide primers specific for the 5" and 3' ends of the HBM nucleotide 
sequence were designed and purchased from Life Technologies (Gaithersburg, MD). All 

15 forward primers (specific for the 5' end of the sequence) were designed to include an Ncol 
cloning site at the 5* terminus. These primers were designed to pranit initiation of protein 
translation at the methionine residue encoded within the Ncol site followed by a valine 
residue and the proteui encoded by the HBM DNA sequence. All reverse primers (specific 
for the 3' end of the sequence) included an EcoRI site at the 5' temiinus to permit cloning of 

20 the HBM sequence into the reading firame of the pET-28b. The pET-28b vector provided a 
sequence encoding an additional 20 carboxyl-tenninal amino acids including six histidine 
residues (at the C-temiinus), which comprised the histidine affinity tag. 
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uenonnc DMA pr^area rrom tne MJSM gene was used as the source ot template ujn A 
for PGR ampUfication (Ausubel et al. Current Protocols in Molecular Biology, John Wiley 
& Sons (1994)). To amplify a DNA sequence containing the HBM nucleotide sequence, 
genomic DNA (50 ng) was introduced into a reaction vial containing 2 mM MgClj, 1 pM 
5 synthetic oligonucleotide primers (forward and reverse primers) complementary to and 
flanking a defined HBM, 0.2 mM of each of deoxynucleotide triphosphate, dATP, dGTP, 
dCTP, dTIT and 2.5 units of heat stable DNA polymerase (Amplitaq, Roche Molecular 
Systems, Inc., Branchburg, NJ) ia a final volume of 100 \il 

Upon completion of theraial cyclmg reactions, each sample of amplified DNA was 
10 purified using the Qiaquick Spin PGR purification kit (Qiagen, Gaithersburg, MD), All 
amplified DNA samples were subjected to digestion with the restriction endonucleases, e.g., 
Ncol and EcoRI (New England BioLabs, Beverly. MA) (Ausubel et al. Current Protocols in 
Molecular Biology, John Wiley & Sons, Inc. (1 994)). DNA samples were flien subjected to 
electrophoresis on 1.0% NuSeive (FMC BioProducts, Rockland, ME) agarose gels. DNA 
15 was visualized by exposure to ethidium bromide and long wave UV irradiation. DNA 

contaiaed in sUces isolated firom the agarose gel was purified using the Bio 101 GeneQean 
Kit protocol (Bio 101, Vista, CA). 

The pET-28b vector was prepared for cloning by digestion with restriction 
endonucleases, e.g., Ncol and EcoRI (New England BioLabs, Beverly, MA) (Ausubel et al, 
20 Current Protocols in Molecular Biology, John Wiley & Sons, Inc. (1994)). The pET-28a 
vector, which encodes the histidine affinity tag that can be fiosed to the 5' end of an inserted 
gene, was prepared by digestion with appropriate restriction endonucleases. 

Following digestion, DNA inserts were cloned (Ausubel et al.. Current Protocols in 
Molecular Biology, John Wiley & Sons, Inc. (1994)) into the previously digested pET-28b 
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expression vector. Products of the ligation reaction were then used to transform the BL21 
strain of coli (Ausubel et ah. Current Protocols in Molecular Biology, John Wiley & Sons, 
Inc. (1 994)) as described below. 

Competent bacteria, E. coli strain BL21 or E. coli strain BL21 (DE3), were 
5 transfonned with recombinant pET expression plasmids carrying the cloned HBM sequence 
according to standard methods (Ausubel et al. Current Protocols in Molecular Biology, John 
Wiley & Sons, hic. (1994)). Briefly, 1 fil of Hgation reaction was mixed with 50 nl of 
electrocompetent cells and subjected to a high voltage pulse, after which samples were 
incubated in 0.45 ml SOC medium (0.5% yeast extract, 2.0% tryptone, 10 mM NaCl, 2.5 mM 
10 KCl, 1 0 mM MgCla, 1 0 mM MgS04 and 20 mM glucose) at ST'C with shaking for 1 hour. 
Samples were then spread on LB agar plates containmg 25 ^g/ml kanamycin sulfate for 
growth overnight. Transformed colonies of BL21 were then picked and analyzed to evaluate 
cloned inserts, as described below. 

Individual BL21 clones transformed with recombinant pET-28b HBM nucleotide 
15 sequences were analyzed by PGR amplification of the cloned inserts using the same forward 
and reverse primers specific for the HBM sequences that were used in the original PGR 
amplification cloning reactions. Successful amplification verifies the integration of the HBM 
sequence in the expression vector (Ausubel et al.. Current Protocols in Molecular Biology, 
John Wiley & Sons, Inc. (1994)). 
20 Individual clones of recombinant pET-28b vectors carrying propa-ly cloned HBM 

nucleotide sequences were picked and incubated m 5 ml of LB broth plus 25 jig/ml 
kanamycin sulfate overnight The following day plasmid DNA was isolated and purified 
using file Qiagen plasmid puiification protocol (Qiagen Inc., Chatsworfh, CA). 
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The pET vector can be propagated in any E. coli K-12 strain, e.g., HMS174, HBlOl, 
JM109, DH5 and the like, for purposes of cloning or plasmid preparation. Hosts for 
expression include E. coli strains containing a chromosomal copy of the gene for T7 RNA. 
polymerase. These hosts were lysogens of bacteriophage DE3, a lambda derivative that 
5 carries the lad gene, the lacUV5 promoter and the gene for T7 RNA polymerase. T7 RNA 
polymerase was induced by addition of isopropyl-p-D-thiogalactoside (IPTG), and the T7 
RNA polymerase transcribes any target plasmid containing a functional T7 promoter, such as 
pET-28b, carrying its gene of interest. Strains include, for example. BL21(DE3) (Studier.er 
fl/..Afe/A. jEtizvwo/., 185:60-89 (1990)). 
10 To express the recombinant HBM sequence, 50 ng of plasmid DNA are isolated as 

described above to transform competent BL21(DE3) bacteria as described above (provided 
by Novagen as part of the pET expression Mt). The lacZ gene (P-galactosidase) is expressed 
in the pET-System as described for the HBM recombinant constructions. Transfonned cells 
were cultured in SOC medium for 1 hour, and the culture was then plated on LB plates 
15 containing 25 jig/ml kanamycin sulfate. The following day, the bacterial colonies were 
pooled and grown in LB medium containing kanamycin sulfate (25 jig/ml) to an optical 
density at 600 nM of 0.5 to 1.0 O.D. units, at which point 1 mM IPTG was added to the 
culture for 3 hours to induce gene expression of the HBM recombinant DNA constructions. 
After induction of gene expression with IPTG, bacteria were coUected by 
20 centrifugation m a Sorvall RC-3B centrifuge at 3500 x g for 15 minutes at 4°C. Pellets were 
resuspended in 50 ml of cold mM Tris-Ha pH 8.0,- 0.1 M NaCl and 0.1 mM EDTA (STE 
buffer). Cells were then centrifiiged at 2000 x g for 20 minutes at 4°C. Wet pellets were 
weighed and frozen at -80°C untU ready for protein purification. 
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A variety of methodologies known in the art can be used to purify the isolated 
proteins (Coligan et al. Current Protocols in Protein Science, John Wiley & Sons (1995)). 
For example, the frozen cells can be thawed, resuspended in buffer and ruptured by several 
passages through a small volume microfluidizer (Model M-llOS, Microfluidics International 
5 Corp., Newton, MA). The resultant homogenate is centrifiiged to yield a clear supernatant 
(crude extract) and, following filtration, the crude extract is firactioned over columns. 
Fractions are monitored by absorbance at OD2S0 mn and peak fractions may be analyzed by 
SDS-PAGE. 

Tlie concentrations of purified protein preparations are quantified 
10 spectrophotometrically using absorbance coefficients calculated from amino acid content 
(Perkins, Ei4r. J. Biochem,, 157:169-180 (1986)). Protein concentrations are also measured 
by the method of Bradford, JnaL Biochem., 72:248-254 (1976) and Lowry et al, J. Biol 
Chenu , 1 93 :265-275 (1 95 1) using bovine serum albxmnn as a standard. 

SDS-polyacrylamide gels of various concentrations were purchased firom BioRad 
15 (Hercules, CA), and stained with Coomassie blue. Molecular weight markers may include 
rabbit skeletal miiscle myosin (200 kDa), coli P-galactosidase (1 16 kDa), rabbit muscle 
phosphoiylase B (97.4 kDa), bovine serum albumin (66.2 kDa), ovalbumin (45 kDa), bovine 
carbonic anyhdrase (31 kDa), soybean trypsin hihibitor (2L5 kDa), egg white lysozyme (14.4 
kDa) and bovine aprotinin (6.5 kDa). 
20 Once a sufficient quantity of the desired protein has been obtained, it may be used for 

various purposes. A typical use is the production of antibodies specific for binding. These 
antibodies may be either polyclonal or monoclonal, and may be produced by in vitro or in 
vivo techniques well known m the art Monoclonal antibodies to epitopes of any of the 
peptides identified and isolated as described can be prepared fiiom murine hybridomas 
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(Kohler, Nature, 256:495 (1975)). In smmnary. a mouse is inoculated with a few micrograms 
ofHBM protein over a peiiod of two weelcs. The mouse is then sacrificed. The ceDs that 
produce antibodies are then removed from the mouse's spleen. The spleen cells are then 
fused with polyethylene glycol with mouse myeloma cells. The successfully fused cells are 
5 dilutedinamicrotiterplateandgrowthofthecultureiscontinued. The amount of antibody 
per well is measured by immunoassay methods such as ELISA (Engvall, Meth. Eiizymol, 
70:419 (1980)). Clones producing antibody can be expanded and further propagated to 
produce HBM antibodies. Other suitable techniques involve in vitro exposure of 
lymphocytes to the antigenic polypeptides, or alternatively, to selection of Hbraries of 
10 antibodiesinphageorsimilarvectors. See Huse a/., 5cze«c«. 246:1275-1281 (1989). For 
additional information on antibody production see Davis et al, Basic Methods in Molecular 
Biology, Elsevier, NY, Section 21-2 (1989). 

Standard protocols for assessing the influence of an agent {e.g., antibody, HBM 
protein, protein polymorphism or Zmaxl protein or compound) to alter lipid levels in a ceU or 
15 the physiological levels in a subject are known. For example. seeF.W. HEMMING. LIPID 
■ ANALYSIS (Bios Scientific Pub. 1996) and J. M. Ordovas, Lipoprotein Protocols 
(Humana Press mc. 1997). More specifically, cholesterol a^d triglyceride analysis can be 
performed using the Olympus AU5000 Cholesterol method. Tins method of measuring 
cholesterol combines the use of the enzymes with a modification of the peroxidase-pheol-4- 
20 aminoantipyrine system, substituting 2.hydroxy-3,5-dichlorobenzene sulfonic acid (2-OH 3,5 
DCBSA) for the phenohc group for the measurement of total cholesterol in the subject serum. 
The assay is based on a series of coupled enzymatic reactions. Cholesterol esters present in 
serum are hydrolyzed to free cholesterol and fetty acids by cholesterol esterase. The 
cholesterol is in turn oxidized by cholesterol oxidase to cholest-4.en-3-one wifli the 
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simuiianeous proaucnon oi Jiydxogen peroxiaase. i Jie hydrogen peroxidase reacts ^th 4- 
aminoantipyiine in the presence of 2-OH-3,5-DCBSA to produce a chromophore that absorbs 
at 570 nin. The absorbance of the reaction mixture is measured biochromaticaUy at 570/750 
nm and is proportional to the cholesterol concentration of the sample. 
5 For serum triglyceride analysis, the 01ynq)us AU5000 triglyceride procedure can also 

be used. Briefly, it is based on a series of coupled enzymatic reactions. Triglycerides in the 
serum are hydrolyzed to free fatty acids and glycerol by lipoprotein Hpase. Glycerol is 
phosphorylated enzymaticaUy and then oxidized with glycerol phosphate oxidase. TTie 

hydrogen peroxidase reacts with the chromogen 4-amino-antipyrine in the presence of DCB 
10 Sulfonic Acid to give a chromophore with absorption which is measured bichromatically at 
520/660 mn. The increase in absorijance of the reaction mixture is proportional to the 
triglyceride concentration of the sample. 

XVn. Methods of Use: Gene Therapy 

In recent years, significant technological advances have been made in the area of gene 
15 therapy for both genetic and acquired diseases. (Kay et oL, Proc Natl Acad, ScL USA, 
94: 12744-12746 (1997)) Gene ther^y can be defined as &e dehberate transfer of DNA for 
therapeutic purposes, hnprovement in gene transfer methods has allowed for development of 
gene therapy protocols for the treatment of diverse types of diseases. Gene ther^y has also 
taken advantage of recent advances in the identification of new therapeutic genes, 
20 improvement in both viral and nonviral gene delivery systems, better understanding of gene 
regulation, and improvement in cell isolation and transplantation. 

The experiments below identify the HBM gene as a dominant mutation confening 
elevated bone mass and that alters lipid levels. The fact that this mutation is dominant 
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indicates ttiat expression of the HBM protein causes elevated bone mass and perhaps changes 
in hpid levels. Older individuals carrying the HBM gene, and, therefore expressing the HBM 
protein, do not suffer from osteoporosis. These individuals are equivalent to individuals 
being treated with the HBM protein. These observations are a strong experimental indication 
5 that therapeutic treatment with the HBM protein prevents osteoporosis. The bone mass • 
elevating activity of the HBM gene is tenned "HBM function." 

Therefore, according to the present invention, a method is also provided of supplying 
HBM function to mesenchymal stem cells (Onyia et al, J. Bone Miner. Res., 13:20-30 
(1998); Ko et al.. Cancer Res., 56:4614-4619 (1996)). Supplying such a fimction provides 
10 protection against osteoporosis. For regulating Upid levels, HBM function can be suppUed to 
liver cells, as well as other cells involved in Upid metabolism and lipid regulation (e-g., 
muscle cells, lesion cells, Upid laiden foam cells and megakaryoblasts). The HBM gene or a 
part of the gene may be introduced iuto the ceU in a vector such that the gene remains 
extrachromosomaL In such a situation, the gene will be expressed by the cell from the 
15 extrachromosomal locadon. 

Vectors for introduction of genes both for recombination and for extrachromosomal 
maintenance are known in the art, and any suitsfcle vector may be used. Methods for 
introducing DNA into cells such as electroporation, calcium phosphate co-precipitation, and 
viral transduction are known m the art, and the choice of method is wilhm the competence of 
20 one skilled in tiie art (Robbins, Ed., Gene Therapy Protocols, Human Press, (1997)). 
Cells transformed with the HBM gene can be used as model systems to study osteoporosis 
and dmg treatments that promote bone growth as well as to study lipid-mediated diseases. 

As generally discussed above, the HBM gene or fragment, where appUcable, may be 
used in gene therapy methods in order to increase the amount of the expression products of 
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such genes in mesenchymal stem cells or in other ceUs. It may be useful also to increase the 
level of expression of a given HBM protein, or a fragment thereof, even in those cells iii 
which the wild type gene is expressed nomially. Gene therapy would be carried out 
according to generaUy accepted methods as described by. for example, Friedman, nerapyfo 
5 Genetic Diseases, Friedman. Ed., Oxford University Press, pages 105-121 (1991). 

A virus or plasmid vector containing a copy of the HBM gene linked to expression 
control elements and capable of repHcating inside mesenchymal stem ceUs or liver cells, is 
prepared. Suitable vectors are known and described, for example, in U.S. Patent No. 
5,252,479 and WO 93/07282, the disclosures of which are incorporated by reference herein in 
10 their entirety. ^Tie vector is then injected mto the patient, either locally into the bone marrow 
or Hver, or systemically (in order to reach any mesenchymal stem ceUs located at other sites, 
Le., in the blood). If the transfected gene is not permanently incorporated into the genome of 
each of the targeted cells, the treatment may have to be repeated periodicaUy. 

Gene transfer systems known in the art may be useful in the practice of the gene 
15 ther^y methods ofthe present invention. These include viral and non-viral transfer methods. 
A number of viruses have been used as gene transfer vectors, including polyoma, Le.. SV40 
(Madzak et al., J. Gen. Virol., 73:1533-1536 (1992)), adenoyirus (Beriaier, Cwr. Top. 
Microbiol. Immunol., 158:39-61 (1992); Beikner et aL, Bio Techniques, 6:616-629 (1988); 
Gor2igliaera/..j: Virol, 66:A401^12 {I992)i et al, Proc. NatL Acad. ScL USA, 

20 89:2581-2584 (1992); Rosenfeld et al. Cell, 68:143-155 (1992); Wilkinson et al, Nucl. 
Acids Res., 20:2233-2239 (1992); Stratford-Penicaudet et oL, Hvm. Gene Ther., 1-.241-256 
(1990)). vaccinia virus (Mackett et al. Biotechnology, 24:495-499 (1992)), adeno-associafed 
virus (Muzyczka, Curr. Top. Microbiol Immunol, 158:91-123 (1992); Ohi et al. Gene, 
89:279-282 (1990)), herpes viruses including HSV and EBV (Margolskee. Curr. Top. 
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Microbiol. Immunol., li«:67-yu Joimson et aL, J. Virol., 66:2952-2965 (1992); FinJc 

etal.. Hum. Gene Ther., 3:11-19 (1992); Breakfield etal., Mol. NeurobioL, 1:337-371 
(1987;) Fresse et al.,£iochem. Pharmacol., 40:2189-2199 (1990)), and retroviruses of avian 
(Brandyopadhyay et aL, Mol. Cell Biol., 4:749-754 (1984); Petropouplos et al, J. Virol, 
5 66:3391-3397 (1992)), murine (Miller, Curr. Top. Microbiol. Immunol, 158:1-24 (1992); 
MiUer et al., Mol Cell Biol, 5:431-437 (1985); Serge et al, Mol Cell Biol, 4:1730-1737 
(1984); Mam et al, J. Virol, 54:401-407 (1985)), and human origin (Page et al, J. Virol, 
64:5370-5276 (1990); Buchschalcher a/., Virol, 66:2731-2739 (1992)). Most human 
gene therapy protocols have been based on disabled murine retroviruses. 
10 Non-viral gene transfer methods known in the art include chemical techniques such as 

calcium phosphate coprecipitation (Graham et al. Virology, 52:456-467 (1973); Pemcer et 
al:. Science, 209:1414-1422 (1980)), mechanical techniques, for example microinjection 
(Anderson et al, Proc. Natl Acad. Sci. USA, 77:5399-5403 (1980); Gordon et al, Proc. Natl 
Acad. Sci. USA, 77:7380-7384 (1980); Brinster et al. Cell, 27:223-231 (1981); Constantini et 
15 al. Nature, 294:92-94 (1981)), membrane fiision-mediated transfer via liposomes (Feigner et 
al, Proc. Natl Acad Sci. USA, 84:7413-7417 (1987); Wang et al. Biochemistry, 28:9508- 
9514 (1989); Kaneda et al, J. Biol Chem., 264:12126-1212.9 (1989); Stewart et al. Hum. 
Gene Ther., 3:267-275 (1992); Nabel et al. Science, 249:1285-1288 (1990); Lim et al. 
Circulation, 83:2007-2011 (1992)), and direct DNA uptake and receptor-mediated DNA 
20 transfer (Wolff al. Science, 247:1465-1468 (1990); Wu et al., BioTechniques, 11:474-485 
(1991); Zenke et al , Proc. Natl Acad. Sci. USA, 87:3655-3659 (1990); Wu et al, J. Biol 
Chem., 264:16985-16987 (1989); Wolff ef al, BioTechniques, 11:474-485 (1991); Wagner 
et al, 1990; Wagner et al, Proc. Natl Acad. Sci. USA, 88:4255-4259 (1991); Gotten et al, 
Proc. Natl Acad. Sci. USA, 87:4033-4037 (1990); Curiel et al., Proc. Natl. Acad. Set USA, 
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88:8850-8854 (1991); Cuiiel etal.,Hum. Gene Ther., 3:147-154 (1991)). Viral-mediated 
geae transfer can be combined with direct in vivo vectors to the mesenchymal stem cells and 
not into the surrounding cells (Romano al.. In Vivo, 12(l):59-67 (1998); Gonez et al., 
Hum.Mol. Genetics, 7(12):19n.9 (1998)). Alternatively, the retroviral vector producer ceU 
5 line can be injected into the bone marrow (Culver et al.. Science, 256: 1550-1552 (1992)). 
Injection of producer cells would then provide a continuous source of vector particles. This 
technique has been approved for use in humans with inoperable brain tumors. 

In an ^proach which combines biological and physical gene transfer methods, 
plasmid DNA of any size is combined with apolylysine-conjugated antibody specific to the 
10 adenovirushexonprotem.andtheresultingcomplexisboundtoanadenovirusvector. THe 
trimolecular complex is then used to infect cells. The adenovirus vector pemnts efficient 
binding, internalization, and degradation of the endosome before the coupled DNA is 
damaged. 

Liposome/DNA complexes have been shown to be capable of mediating direct in vivo 
15 gene transfer. WhUe in standard liposome preparations the gene transfer process is non- 
specific, locahzed in vivo uptake and expression have been reported in tumor deposits, for 
example, following direct in situ administration (Nabel, Hum.. Gene Ther., 3:399-410 (1992)).. 

XVm. Methods of Use: Transformed Hosts, Development of Pharmaceuticals and 
Research Tools 



20 



Cells and animals that cany the HBM gene can be used as model systems to study and 
test for substances tiiat have potential as therapeutic agents (Qnyia et al, J. Bone Miner. Res., 
13:20-30 (1998); Broder etal.. Bone, 21:225-235 (1997)). The cells are typically cultured 
mesenchymal stem cells or liver cells. These may be isolated from individuals with somatic 
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or germline HBM genes. Alternatively, the cell line can be engineered to carry the HBM 
gene, as described above. After a test substance is applied to the cells, the transformed 
phenotype of the cell is determined. Any trait of transformed cells can be assessed, including 
formation of bone matrix in culture (Broder et al.. Bone, 21:225-235 (1997)), mechanical 
5 properties (Kizer et al., Proc. Natl. Acad. Sci. USA, 94:1013-1018 (1997)), and response to 
application of putative therapeutic agents. 

Animals for testing therapeutic agents can be selected after treatment of germline cells 
or zygotes. Such treatments include insertion of the Zmaxl gene, as well as insertion of the 
/ZBJW gene and disrupted homologous genes. Alternatively, the inserted Zwoxi gene(s) 
10 and/or HBM gene(s) of the animals may be disrapted by insertion or deletion mutation of 
other genetic alterations using conventional techniques, such as those described by, for 
example, Capechi, Science, 244:1288 (1989); Valancuis et al., MoL Cell Biol, 11:1402 
(1991); Hasty et al.. Nature, 350:243 (1991); Shinkai et al. Cell, 68:855 (1992); Mombaerts 
et al. Cell, 68:869 (1992); Philpott et al. Science, 256:1448 (1992); Snouwaert et al, 
15 Science, 257:1083 (1992); Donehower et al. Nature, 356:215 (1992). After test substances 
have been administered to the animals, the growth of bone or modulation of Upids must be 
assessed. If the test substance enhances the growth of bone . or regulates lipid levels, then the 
test substance is a candidate ther^eutic agent. These animal models provide an extremely 
important vehicle for potential therapeutic products. Preferred models for studying Hpid 
20 modulation include mice (Smith et al. J. Intern. Med., 242: 99-109 (1997)) and guinea pigs. 

Individuals carrying the HBM gene have elevated bone mass and altered lipid levels 
as discussed in the example below. The HBM gene causes this phenotype by altering the 
activities, levels, expression patterns, and modification states of other molecules involved in 
bone development. Using a variety of estabUshed techniques, it is possible to identif/ 
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molecules, preferably proteins or mRNAs, whose activities, levels, expression patterns, and 
modification states are different between systems containing the Zmax 1 gene and systems 
containing the HBM gene. Such systems can be, for example, cell-free extracts, cells, tissues 
or living organisms, such as mice or humans. For a mutant form of Zmaxl, a complete 
5 deletion of Zmaxl , mutations lacking the extraceUular or intraceUular portion of the protein, 
or any other mutation in the Zmaxl gene may be used. It is also possible to use expression of 
antisense Zmaxl RNA or oligonucleotides to inhibit production of the Zinaxl protein. For a 
mutant fomi of HBM, a complete deletion of HBM. mutations lacking the extracellular or 
intraceUular portion of the HBM protein, or any otiier mutation in the HBM^^^ may be 
10 used. It is also possible to use expression of antisense HBM RNA or oUgonucleotides to 
• inhibit production of the HBM protein. 

Molecules identified by comparison of Zmaxl systems and HBM systems can be used 
as surrogate markers in pharmaceutical development or in diagnosis of human or animal bone 
disease. Alternatively, such molecules may be used in treatment of bone disease. See, 
15 Schena et al. Science, 270:467-470 (1995). 

For example, a transgenic mouse carrying the HBM gone in the mouse homologue is 
constructed. A mouse of die genotype HBM/+ is viable, healthy and has elevated bone mass. 
To identify sum.gate markers for elevated bone mass. HBM-/+ (i.e.. heterozygous) and 
isogenic +/+ (i.e.. wild-type) mice are sacrificed. Bone tissue mRNA is extracted from each 
20 animal, and a "gene chip" corresponding to mRNAs expressed in the +/+ individual is 
constructed. mRNA from different tissues is isolated from animals of each genotype, 
reverse-transcribed, fluorescently labeled, and then hybridized to gene fragments afBxed to a 
soUd support The ratio of fluorescent intensity between the two populations is indicative of 
the relative abundance of the specific mKNTAs in the +/+ and HBM/+ animals. Genes 
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encoding mRNAs over- and under-expressed relative to tiie wild-type control axe candidates 
for genes coordinate^ regulated by the HBM gene. This strategy can be sinularly used to 
study lipid regulation. 

Mice also serve as the most common experimental animal model for atherosclerosis 
5 research. There are at least three ways of inducing atherosclerosis in mice: (1) diet induced, 
apoE deficiency-induced and LDL receptor-deficiency induced. The methods for using a 
' mouse model for testing, agents which modulate Upid levels in vivo dan be performed as 
described in Smith fl/., J. /nrem. M2^ 242: 99-109 (1997). 

One standard procedure for identification of new proteins that are part of the same 
10 signaling cascade as an ahready-discovered protein is as follows. Cells are treated with 

radioactive phosphorous, and the already-discovered protein is manipulated to be more or less 
active. The phosphorylation state of other proteins in the cell is then monitored by 
polyaciylamide gel electrophoresis and autoradiography, or similar techniques. Levels of 
activity of the known protein may be manipulated by many methods, including, for example, 
15 comparing wild-type mutant proteins using specific inhibitors such as drugs or antibodies, 
simply adding or not adding a known extracellular protein, or using antisense inMbition of 
the expression of the known protein (Tamura et al, defence, 280(5369): 1614-7 (1998); 
MsD^^EMBOJ., 17(15):4391-403 (1998); Cooper era/.. Cell, 1:263-73 (1982)). 

In another example, proteins Avith different levels of phosphorylation are identified in 
20 TE85 osteosarcoma cells expressing either a sense or antisense cDNA for Zmaxl . TE85 cells 
normally express high levels of Zmaxl (Dong et al, Biochem. <&. Biophys. Res. Comm., 
25 1 :784-790 (1 998)). Cells containing the sense construct express even hi^er levels of 
Zmaxl, while cells expressing the antisense construct express lower levels. Cells are grown 
in the presence of ^^P, harvested, lysed, and the lysates run on SDS polyacrylamide gels to 
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separate protems. and the gels subjected to autoradiography (Ausubel et al. Current 
Protocols in Molecular Biology, John Wiley & Sons (1997)). Bands that differ in intensity 
between the sense and antisense cell lines represent phosphoproteins whose phosphorylation 
state or absolute level varies in response to levels of Zmaxl . As an alternative to the ^^p. 
5 labeling, unlabeled proteins may be separated by SDS-PAGE and subjected to 

imniTmoblotting, using the connnerciaUy available anti-phosphotyrosine antibody as a probe 
Cniomas al.. Nature, 376(6537):267.71 (1995)). As an alt«native to the expression of 
antisense RNA, transfection with chemically modified antisense ohgonucleotides can be used 
(Woolf e/ al. Nucleic Acids Res., 18(7):1763-9 (1990)). 
10 Many bone disorders, such as osteoporosis, have a slow onset and a slow response to 

treatment. It is therefore useful to develop surrogate markers for bone development and 
minerahzation. Such markers can be useful in developing treatments for bone disorders, and 
for diagnosing patients who may be at risk for later development of bone disorders. 
Examples of preferred markers are N- and C-terminal telopeptide markers described, for 
15 example, in U.S. Patent Nos. 5,455,179, 5.641.837 and 5,652,1 12, the disclosures of which 
are incorporated by reference herein in fteir entirety. In the area of HIV disease, CD4 counts 
and viral load are useful surrogate markers for disease progression (Vlahov et al, JAMA, 
279(l):35-40 (1998)). There is a need for analogous surrogate markers in the area of bone 
disease. 

20 A Seagate marker can be any characteristic that is easily tested and relatively 

insensitive to non-specific influences. For exanqile. a surrogate marker can be a molecule 
such as a protein or mRNA in a tissue or in blood serum. Alternatively, a surrogate maricer 
may be a diagnostic sign such as sensitivity to pain, a reflex response or the like. 
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In yet another example, surrogate markers for elevated bone mass are identified using 
a pedigree of humans carrying the HBM gene. Blood samples are withdrawn from three 
individuals that carry the HBM gene, and from three closely related individuals that do not. 
Proteins in the serum from these individuals are electrophoresed on a two dimensional gel 
system, in which one dimension separates proteins by size, and another dimension separates 
proteins by isoelectric point (Epstein et al. Electrophoresis, 17(ll):1655-70 (1996)). Spots 
corresponding to protems are identified. A few spots are expected to be present in different 
amounts or in slightly different positions for the HBM individuals compared to their normal 
relatives. ' These spots correspond to proteins that are candidate surrogate markers. The 
identities of the proteins are determined by microsequencing, and antibodies to the proteins 
can be produced by standard methods for use in diagnostic testing procedures: Diagnostic 
assays for HBM proteins or other candidate surrogate markers include using antibodies 
described in this invention and a reporter molecule to detect HBM in human body fluids, 
membranes, bones, cells, tissues or extracts thereof. The antibodies can be labeled by joining 
tiiem covalently or noncovalently with a substance that provides a detectable signal. In many 
scientific and patent Uterature, a variety of reporter molecules or labels are described 
including radionucUdes, enzymes, fluorescent, chemi-luminescpnt or chromogenic agents 
(U.S. Patent Nos. 3,817,837; 3,850,752; 3,939.350; 3.996,345; 4,277,437; 4,275,149; and 
4,366,241). 

Using these antibodies, the levels of candidate surrogate markers are measured in 
nomial individuals and in patients suffering from abone disorder, such as osteoporosis, 
osteoporosis pseudogHoma, Engehnami's disease. Ribbing's disease, hypeiphosphatasemia, 
VanBuchem's disease, melorheostosis, osteopetrosis, pychodysostosis, sclerosteosis, 
osteopoikilosis, acromegaly, Paget' s disease, fibrous dysplasia, tubular stenosis, osteogenesis 
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imperfecta, hypoparathyroidism, pseudohypoparathyroidism, 
pseudopseudohypoparathyroidism, primary and secondary hyperparathyroidism and 
associated syndromes, hypercalciuiia, medidlary carcinoma of the thyroid gland, 
osteomalacia and other diseases including Upid-mediated diseases. Techniques for measuring 
5 levels of protein in serum in a clinical setting using antibodies are well established. A protem 
that is consistently present in higher or lower levels in individuals carrying a particular 
disease or type of disease is a useful surrogate marker. 

A surrogate marker can be used in diagnosis of a bone disorder. For example, 
consider a child that present to a physician with a high ftequency of bone fracture. The 
10 underlying cause may be child abuse, inappropriate behavior by the child, or a bone disorder. 
To rapidly test for a bone disorder, the levels of the surrogate marker protein are measured 
using the antibody described above. 

Levels of modification states of surrogate markers can be measured as indicators of 
the likely effectiveness of a drug that is being developed. It is especially convenient to use 
15 suiTOgate markers in creating treatments for bone disorders, because alterations in bone 

development or mineralization may require a long time to be observed. For example, a set of 
bone mRNAs, termed the "HBM-inducible mKNA set" is found to be overexpressed in 
HBM/+ mice as compared to +/+ mice, as described above. Expression of this set can be 
used as a surrogate niarker. SpecificaUy, if treatment of +/+ mice with a compound results in 
20 overexpression of the HBM-inducible mKNA set, then that compound is considered a 
promising candidate for furtitier development 

This iQvention is particularly usefiil for screening compounds by using the Zmaxl or 
HBM protein or binding fragment thereof in any of a variety of drug screening techniques. 
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The Zmaxl or HBM protein or fragment employed in such a test may either he free in 
solution, affixed to a solid stq)port, or borne on a cell surface. One method of drug screening 
utilizes eukaryotic or prokaryotic host cells which are stably transformed with recombinant 
nucleic acids expressing the protein or fragment, preferably in competitive binding assays. 

5 Such cells, either in viable or fixed form, can be used for standard binding assays. One may 
measure, for example, for the formation of complexes between a Zmaxl or HBM protein or 
fragment and the agent being tested, or examine the degree to which the fonnation of a 
complex between a Zmaxl or HBM protein or fragment and a known Ugand is interfered with 
by the agent being tested. 

10 Thus, the present invention provides methods of screening for drugs comprising 

contacting such an agent with a Zmaxl or HBM protein, or fragment thereof and assaying (i) 
for the presence of a complex between the agent and the Zmaxl or HBM protein or fragment, 
or (ii) for the presence of a conq)lex between the Zmaxl or HBM protein or fragment and a 
ligand, by methods weU known in the art. In such competitive binding assays the Zmaxl or 

15 HBM protein or fragment is typicaUy labeled. Free Zmaxl or HBM protein or fragment is 
separated from that present in a protein:protein complex, and the amount of free (i.e., 
uncomplexed) label is a measure of the binding of the agent.being tested to Zmaxl or HBM 
or its interference with Zmaxl or HBM: Ugand bindmg, respectively. 

Another technique for drug screening provides high throughput screening for " 

20 compoundshavingsuitablebindingafBnitytotheZmaxlorHBMproteinsandisdescribed 

in detail in WO 84/03564. Briefly stated, large numbers of different small peptide test 
compounds are synthesized on a soUd substrate, such as plastic pins or some other surface. 
The peptide test compounds are reacted with Zmaxl or HBM proteins and washed. Bound 
Zmaxl or HBM protein is then detected by methods well known in the art. Purified Zmaxl 
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or HBM can be coated directly onto plates for use in the aforementioned drug screening 
techniques. However, non-neutralizing antibodies to the protein can be used to capture^ 
antibodies to inunobili^e the Zmaxl or HBM protein on the solid phase. 

Tins invention also contemplates the use of competitive drug screening assays in 
5 which neutralizing antibodies capable of specifically binding the Zmaxl or HBM protein 
compete with a test compound for binding to the Zmaxl or HBMpiotein or fragments 
thereof. In this mamier, the antibodies can be used to detect the presence of any peptide that 
shares one or more antigenic determinants of the Zmaxl or HBM protein. 

A further technique for drug screening involves the use of host eukaiyotic cell lines or 
10 cells(suchasdescn^edabove)thathaveanonfunctionalZmaxlor^Mgene. Tlxeschost 
cell lines or cells are defective at the Zmaxl cr HBM protein level. The host cell lines or 
cells are grown in the presence of -drug compound. Th^ rate of growth of the host cells or 
impact on lipid metabolism is measured to detemiine if the compound is capable of 
regulating the growth or lipid metaboHsm of Zmaxl or HBM defective cells. 

The goal of rational drjig design is to produce stmctural analogs of biologically active 
proteins of interest or of smaU molecules with which they interact (e.g., agonists, antagonists, 
inhibitors) in order to fashion drugs which are. for example..more active or stable forms of 

theprotein.orwWch,e.g.,enhanceorinterferewiththefimctionofaproteinz«vfv^ See. 
e-g.. Hodgson. Bio/Technolosy, 9:19-21 (1991). In one approach, one first determines the 

20 toe-dimensionalstmctureofaproteinofinterest(e.g..ZmaxlorHBMprotein)or.for 
example, of the Zmaxl- or HBM-receptor or ligand complex, by X-ray crystallography, by 
computer modeling or most typically, by a combination of approaches. Less often, usefiil 
information regarding the stmcture of a protein may be gained by modeling based on the 
structure of homologous proteins. An example of rational dmg design is the development of " 
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HIV protease inhibitors (Erickson et al.. Science. 249:527-533 (1990)). In addition, peptides 
(e.g., Zmaxl or HBM protein) are analyzed by an alanine scan (Wells, Methods in Eivzymol, 
202:390-41 1 (1991)). In this technique, an amiao acid residue is replaced by Ala, and its 
effect on the peptide's activity is determined. Each of the amino acid residues of the peptide 

5 is analyzed m tiiis manner to determine the important regions of the peptide. 

It is also possible to isolate a target-specific antibody, selected by a functional assay, 
and then to solve its crystal structure. lu principle, this approach yields a phaimacore upon 

• which subsequent drug design can be based. It is possible to bypass protein crystaUography 
altogether by generating anti-idiotypic antibodies (anti-ids) to a functional, pharmacologically 

10 active antibody. As a mirror image of a minor image, the bindmg site of the anti-ids would 
be expected to be an analog of the original receptor. The anti-id could then be used to 
identify and isolate peptides ftom banks of chemically or biologicaUy produced banks of 
peptides. Selected peptides would then act as the phaimacore. 

Thus, one may design drugs which have, e.g., improved Zmaxl or HBM protein 

15 activity or stability or which act as inhibitors, agonists, antagonists, etc. of Zmaxl or HBM 
protein activity. By virtue of the availability of cloned Zmaxl or HBM sequences, sufficient 
amounts of the Zmaxl or HBM protein may be made available to perform such analytical 
studies as X-ray crystaUography. M addition, the knowledge of the Zmaxl or HBM protein 
-sequence provided herein will guide those employing computer modeling techniques in place 

20 of; or in addition to x-ray caystallography. 

XIX. Methods of Use: Avian and Mammalian Animal Husbandry 

The Zmaxl DNA and Zmaxl protein and/or the HBM DNA and HBM protein can be 
used for vertebrate and preferably human therapeutic agents and for avian and mammaUan 
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veterinary agents, mcludang for livestock breeding. Animals contemplated as subjects 
include livestock (e.g.. cattle, pigs, sheep, goats, horses, buffalo, etc.), primates, canines, 
felines, rodents, birds, as well as reptiles, fish, and amphibians. Birds, including, for 
example, chickens, roosters, hens, turkeys, ostriches, ducks, pheasants and quails, can benefit 
5 fromtheidentificationofthegeneandpathwayforhighbonemass. In many examples cited 
iaUterature (for example. McCoy ./.. Res. Vet. ScL, 60(2):185-186 (1996)). weakened 
bones due to husbandry conditions cause cage layer fatigue, osteoporosis and high mortahty 
- rates. Additional therapeutic agents to treat osteoporosis or other bone disorders in birds 
have considerable beneficial effects on avian welfare and the economic conditions of the 
10 - livestock industry, includmg, for example, meat and egg production. 



can 



XX. Methods of use: Diagnostic assays using Zmaxl-specific oUgonucleotides for 
detection of genetic alterations affecting bone development and Upid regulation. 

In cases where an alteration or disease of bone development or lipid metabolism is 
suspected to involve an alteration of the Zmaxl gene or the /©AT gene, specific 
15 oligonucleotides may be constructed and used to assess the level of Zmaxl mRNA or HBM 
mRNA, respectively, in bone tissue or in another. tissue that.affects bone development 

For example, to test whether a person has the HBM gen^, which affects bone density 
and lipid regulation, polymerase chain reaction can be used. Two oligonucleotides are 
synthesized by standard methods or are obtained from a commercial siq,pHer of custom-made 
20 oligonucleotides. The length and base composition are determined by standard criteria using 
the OHgo 4.0 primer Picking program (Wojchich Rychlik, 1992). One of the 
ohgonucleotides is designed so that it will hybridize only to HBM DNA under the PGR 
conditions used. The other oHgonucleotide is designed to hybridize a segment of Zmaxl 
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genomic DNA, such that ampHfication of DNA using these oligonucleotide primers produces 
a conveniently identified DNA firagment. For example, the pair of primers 
CCAAGTTCTGAGAAGTCC (SEQ ID NO:32) and AATACCTGAAACCA TACCTG 
(SEQ ID NO:33) wiU amplify a 530 base pair DNA firagment from a DNA sample when the 
5 following conditions are used: step 1 at 95«'C for 120 seconds; step 2 at 95 °C for 30 seconds; 
step 3 at 5S°C for 30 seconds; step 4 at 72°C for 120 seconds; where steps 2-4 are repeated ' 
t 35 times. Tissue samples may be obtained from hair follicles, whole blood, or the buccal 
cavity. 

The fragment generated by the above procedure is sequenced by standard techniques. 
10 Individuals heterozygous for the HBM gene wiU show an equal amount of G and T at tiie 
second position in the codon for glycine 171. Normal or homozygous wild-type individuals 
will show only G at this position. 

Other ampKfication fechniques besides PGR may be used as alternatives, such as 
Ugation-mediated PGR or techniques involving Q-beta repUcase (Cahill et al., Clin. Chem., 
15 37(9):1482-5 (1991)). For example, the ohgonucleotides AGCTGCTCGTAGCTGTCTCT 
CCCTGGATCACGGGTACATGTACTGGACAGACTGGGT (SEQID NO:34) and 
TGAGACGCCCCGGATTGAGCGGGCAGGGATAGCTTATTCCCTGTGCCGCATTACG 

GC (SEQ ID NO:35) can be hybridized to a denatured human DNA sample, treated with a 
DNA hgase, and then subjected to PGR amplification using the primer ohgonucleotides 
20 AGCTGCTCGTAG CTGTCTCTCCCTGGA (SEQ ID NO:36) and 

GCCGTAATGCGGCACAGGGAATAAGCT (SEQ ID NO:37). In the first two 
oligonucleotides, the outer 27 bases are random sequence corresponding to primer binding 
sites, and the inner 30 bases correspond to sequences in the Zmaxl gene. The T at the end of 
the first oHgonucleotide corresponds to theiiZBMgene. The first two ohgonucleotides are 
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ligated only when hybridized to human DNA canying the ^Afgene. which results in the 
formation of an amplifiable 11 4 bp DNA fragment. 

Products of amplification can be detected by agarose gel electrophoresis, quantitative 
hybridization, or equivalent techniques for nucleic acid detection know to one sldUed in the 
5 art of molecular biology (Sambrooker a/., Mofec«/arC/om>z^.-^i^^^^ Cold 
Spring Harbor Laboratory, Cold Spring, NY (1989)). 

Other alterations in the Zmaxl gene or the ^ffiil/gene may be diagnosed by the same 
type of amplification-detection procedures, by using oligonucleotides designed to identify 
those alterations. These procedures can be used in animals as weU as hmnans to identify 
10 alterationsin Zmaxl or HBM that affect bone development and/or Hpid metaboKsm or levels. 
Expression of Zmaxl or HBM in bone tissue may be accomplished by fiasihg the 
cDNAofZmaxl or HBM, respectively, to a bone-specific promoter in the contextof a vector 
for genetically engineering vertebrate cells. DNA constructs are introduced into cells by 
packaging the DNA into virus capsids, by the use of cationic Uposomes, electropomtion, or 
15 by calcium phosphate transfection. Transfected cells, preferably osteoblasts, may be studied 
in culture or may be introduced into bone tissue m animals by direct injection into bone or by 
intravenous injection of osteoblasts, foUowed by incorporation into bone tissue (Ko a/.. 
Cancer ResearcK 56(20):4614-9.(1996)). For example," the osteocalcin promoter, which is 
specificaUy active in osteoblasts, may be used to direct transcription of the Zmojcl gene or the 
20 HBil^gene. Any of several vectors and transfection methods may be used, such as retroviral 
vectors, adenovirus vectors, or vectors that are maintained after transfection using cationic 
Kposomes, or other methods and vectors described herein. 

Similarly Zmaxl, or HBM can be expressed in Hver tissue or in other Hpid- 
metabolism or Hpid-regulating cells, such as hpid laden foam ceUs or lesion cells. This can 
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be accompHshed by fusing the cDNA of Zmaxl or HBM respectively to, for example, a Uver 
specific promoter or other suitable promoter in the context of a vector for genetically 
engineering vertebrate cells. DNA constructs are introduced into cells by packaging the DNA 
into, for example; virus capsids, by the use of cationic hposomes, electroporation, or calcium 
. 5 phosphate transfection. The transfected cells, preferably Hver cells, may be studied in culture 
or can be introduced into animals by direct injection into the liver or other cell mvolved in 
Upid regulation or metaboUsm The vectors and bransfection methods to be used are similar 
to those described herein. 

Alteration of the level of functional Zmaxl protein or HBM protein affects the level 
10 ofbone mineralization and Upid levels. By manipulating levels of functional Zmaxl protein 
or HBM protein, it is possible to affect bone development and to increase or decrease levels 
ofbone mineralization as well as Upid levels. For example, it may be useful to increase bone 
mineraUzation in patients with osteoporosis. Alternatively, it may be useful to decrease bone 
mineraUzation in patients with osteopetrosis or Pagefs dis^e. Alteration of Zmaxl levels or 
15 HBM levels can also be used as a research tooL SpecificaUy, it is possible to identify 

proteins, mRNA and other molecules whose level or modification status is altered in response 
to changes in functional levels of Zmaxl or HBM. The patiiology and pathogenesis ofbone 
disorders is known and described, for example, in Rubin and Farber (Eds.), Pathology, 2nd 
Ed., S3. Lippincott Co., Philadelphia, PA (1994). 
20 Zmaxl or HBM protein levels can be altered to regulate Upid levels in a cell or a 

subject The pathology and pathogenesis of atherosclerosis and arteriosclerosis is known and 
described, for example, in Edwin L.Biennan, "Atherosclerosis and Other Forms of • 
Arteriosclerosis," ia Harrison's Principles of Internal Medicine, 1106-1116 (13th Ed., 1994). 
Modulation of Upid levels may be usefiil to lower certain levels of Upids {e.g.. LDL) in 
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patients with arteriosclerosis and/or atherosclerosis, as well as conditions and diseases 
afBHated with atherosclerosis and arteriosclerosis, as described by Bierman (1994). 

A variety of techniques can be used to alter the levels of functional Zmaxl or HBM. 
For example, intravenous or intraosseous injection of the extraceUular portion of Zmaxl or 
5 mutations thereof or HBM ormutations thereof wiU alter the level of Zmaxl activity or 
HBM activity, respectively, in the body of the treated human, animal or bird. Truncated 
versions of the Zmaxl protein or HBM protein can also be injected to alter the levels of 
functional Zmaxl protem or HBM protein, respectively. Certain forms of Zmaxl or HBM 
enhance the activity of endogenous protein, while other forms are inhibitory. 
10 In a preferred embodiment, the HBM protein is used to treat osteoporosis or 

arteriosclerosis. In a further preferred embodiment, the extracellular portion of the HBM 
protein is used. This HBM protein may be optionally modified by the addition of a moiety 
that causes the protein to adhere to the surfece of cells. The protein is prepared in a 
phannaceutically acceptable solution and is administered by injection or another method that 
15 achieves acceptable pharmacokinetics and distribution. 

In a second embodiment of this method, Zmaxl or HBM levels are increased or 
decreased by gene ther^y techniques. To increase Zmaxl^pr HBM levels, osteoblasts or 
another useful ceU type are geneticaUy engineered to express high levels of Zmaxl or HBM 
as described above. Alternatively, to decrease Zmaxl or HBM levels, antisense constructs 

20 thatspecificaUyreducetheleveloftranslatableZmaxlorHBMmRNAcanbeused. In 
general, a tissue-nonspecific promoter may be used, such as the CMV promoter or another 
commerciaDy available promoter found in expression vectors (Wu et at. , Toxicol. Appl 
Pharmacol, 141(l):330-9 (1996)). In a preferred embodiment, a Zmaxl cDNA or its 
antisense is transcribed by a bone-specific promoter, such as the osteocalcin or another 
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promoter, to achieve specific expression in bone tissue. In this way, if a Zmaxl -expressing 
DNA construct or HBM-expressing construct is introduced into non-bone tissue, it will not be 
expressed. Similarly, if a liver-specific promoter is used to express the HBM or Zmaxl 
proteins in Uver or other cell involved in lipid regulation or metaboHsm, the DNA construct 
5 with, for example, a liver-specific promoter wiQ not be e3q)ressed in other non-liver tissues. 

In a third embodiment of this method, antibodies against Zmaxl or HBM are used to 
inhibit its fimction. Such antibodies are identified herein. 

In a fourth embodiment of this method, drugs that inhibit Zmaxl fimction or HBM 
fimction are used. Such drugs are described herein and optimized according to techniques of 
10 medicinal chemistry weU known to one skilled ni the art of pharmaceutical development. 

Zmaxl and HBM interact with several proteins, such as ApoE. Molecules that inhibit 
the interaction between Zmaxl or HBM and ApoB or another binding partner are expected to 
alter bone development and mineralization. Such inhibitors may be usefijl as drugs in the 
treatment of osteoporosis, osteopetrosis, or other diseases of bone mineralization. Such 
15 inhibitors may be low molecular weight compounds, proteins or other types of molecules. 
See, Kim et al, J. Biochem. (Tokyo), 124(6): 1072-1076 (1998). 

Inhibitors of the interaction between Zmaxl or HBM and interacting proteins may be 
isolated by standard dmg-screening techniques. For example, Zmaxl protein, (or a fragment 
thereof) or HBM protein (or a firagment thereof) can be immobilized on a solid support such 
20 as the base of microtiter well. A second protein or protein fragment, such as ApoE is 

derivatized to aid in detection, for example with fluorescein. Iodine, or biotin, then added to 
the Zmaxl or HBM in the presence of candidate compounds that may specifically inhibit this 
protein-protein domain of Zmaxl or HBM, respectively, and thus avoid problems associated 
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With its transmembrane segment Drug screens of this type are weU known to one skiUed in 
the art of pharmaceutical development. 

Because Zmaxl and HBM are involved in bone development and lipid regulation, 
proteins that bind to Zmaxl and HBM are also expected to be involved in bone development 
5 and hpid regulation. Such binding proteins can be identified by standard methods, such as 
co-immunoprecipitation, co-fractionation, or the two-hybiid screen (Ausubel et aL, Current 
Protocoh in Molecular Biology, John Wiley & Sons (1997)). For example, to identify 
Zmaxl-interacting proteins or HBM-interacting proteins using the two-hybrid system, the 
extracellular domain of Zmaxl or HBM is fused to LexA and expressed for the yeast vector 
10 PEG202 (the "bait") and expressed in the yeast strain EGY48. The yeast strain is transformed 
with a "prey" library in the appropriate vector, which encodes a galactose-inducible 
transcription-activation sequence fused to candidate interacting proteins. The techniques for 
initially selecting and subsequently verifying interacting proteins by this method are well 
known to one skilled in the art of molecular biology (Ausubel et al. Current Protocols in 
15 Molecular Biology, John Wiley & Sons (1997)). 

In a prefenred embodiment, proteins that mteract with HBM, but not Zmaxl, are 
identified using a variation of the above procedure (Xu etal_^ Proc. Natl Acad. Set USA. 
94(23):12473-8 (Nov. 1997)). This variation of the two^hybrid system uses two baits, and 
Zmaxl and HBM are each fused to LexA and Te®, respectively. Alternatively, proteins that 
20 interactwiththeHBMbutnotZmaxlarealsoisolated. These procedures are well known to 
one skiUed in the art of molecular biology, and are a simple variation of standard two-hybrid 
procedures. 

As an alternative method of isolating Zmaxl or HBM interacting proteins, a 
biochemical approach is used. The Zmaxl protein or a fragment thereof such as the 
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extraceUular domain, or the HBM protein or a fragment thereof, such as the extracelMar 
domain, is chemicaUy coupled to Sephaxose beads. The Zmaxl- or HBM-coupled beads are 
poured into a column. An extract of proteins, such as serum proteins, proteins in the 
supernatant of a bone biopsy, or intraceUular proteins fiom gently lysed TE85 osteoblastic 

5 cells, is added to the column. Non-specifically bound proteins are eluted. the column is 

washed several times with a low-salt buffer, and then tightly binding proteins are eluted with 
ahigh-salt buffer. These.are candidate proteins that bind to Zmaxl or HBM, and can be 
tested for specific binding by standard tests and conlrol experiments. Sepharose beads used 
for coupling proteins and the methods for perfomiing the coupling are commercially 

10 available (Sigma), and the procedures described here are well known to one skiUed in the art 

of protein biochemistry. 

As a variation of the above procedure, proteins that are eluted by high salt firom the 
Zmaxl- or HBM-Sepharose column are then added to an HBM-Zmaxl -sepharose column. 
Proteins that flow through without sticking are proteins that bind to Zmaxl but not to HBM. 
15 Alternatively, protems that bind to the HBM protein and not to the Zmaxl protein can be 
isolated by reversing the order in which the columns are used. Similar columns can be 
prepared for use in assessing Upid regulation in Uver and o«ier tissues and cells involved in 
lipid regulation and or metabolism. 

XXL Method of Use: Transformation-Associated Recombination (TAR) Cloning 
20 Essential for the identification of novel allelic variants of Zmaxl is the ability to 

examine the sequence of both copies of the gene in an individual. To acQomphsh this, two 
"hooks," or regions of significant similarity, are identified within the genomic sequence such 
that they flank the portion of DNA that is to be cloned. Most preferably, the first of these 
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hooks is derived from sequences 5' to the first exon of interest and the second is derived from 
sequences 3' to the last exon of interest. These two "hooks" are cloned into a bacteriaVyeast 
shuttle vector such as that described by Larionov et al., Proc. Natl. Acad. Sci. USA, 94:7384- 
7387(1997). Other similar vector systans may also be used. To recover the entire genomic 
5 copy of the Zmaxl gene, the plasmid containing the two "hooks" is linearized with a 

restriction endonuclease or is produced by another method such as PGR. This linear DNA 
fragment is introduced into yeast cells along with human genomic DNA. Typically, the yeast 
Saccharomyces cerevisiae is used as a host cell, although chicken host cells can be used as 
wen (Larionov et al. Genet. Eng. (NY). 21:37-55 (1999). During and after the process of 
10 transformation, the endogenous host cell converts the linear plasmid to a circle by a 

recombination event whereby the region of the human genomic DNA homologous to the 
"hooks" is inserted into the plasmid. This plasmid can be recovered and analyzed by methods 
v/eD known to one skilled in the art. Obviously, the specificity for this reaction requires the 
host cell machinCTy to recognize sequences similar to the "hooks" present in the hnear 
15 fragmait. However, 100% sequoice identity is not required, as shown by Kouprma et al.. 
Genomics, 53(l):21-28 (October 1998), where the author describes using degenerate repeated 
sequences common in the human genome to recover fragments of human DNA from a 
rodent/human hybrid cell line. 

hi another example, only one "hook" is required, as described by Larionov et al., 
20 Proc. Natl. Acad. Sci. USA, 95(8):4469-74 (April 1998). For this type of experiment, tenned 
"radial TAR cloning," the other region of sequence similarity to drive the recombination is 
derived from a repeated sequence from the genome. In this way, regions of DNA adjacent to 
the Zmaxl gene coding region can be recovered and examined for alterations that may affect 
fimction. 
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XXn. Methods of Use: Genomic Screening 

The nse of polymorphic genetic markers linked to the HBM gene or to Zmaxl is very 
useful in predicting susceptibility to osteoporosis or other bone diseases. Polymorphic 
genetic markers hnked to the HBM gene or the Zmaxl gene also can be used to predict 
5 susceptibility to arteriosclerosis or atherosclerosis and conditions related thereto. Koller et 
al, Amer. J. Bone Mm. Res., 13:1903-1908 (1998) have demonstrated that the use of 
polymorphic genetic markers is useful for linkage analysis. Similarly, the identification of 
polymorphic genetic markers within the HBM gene will allow the identification of specific 
alleUc variants that are in linkage disequilibrium with other genetic lesions that affect bone 
10 development. Using the DNA sequence from Ihe BACs, a dinucleotide CAn repeat was 
identified and two unique PGR primers that will ampHfy the genomic DNA contaihing this 

repeat were designed, as shown below: 

B200E21C16_L: GAGAGGCTATATCCCTGGGC (SEQ ID NO:38) 
B200E21C16_R: ACAGCACGTGTITAAAGGGG (SEQ ID NO:39) 
15 and used in the genetic mapping study. 

This method has been used successfiilly by oUiers skUled in the art (e.g., Sheffield et 
al.. Genet., 4:1837-1844 (1995); LeBlanc-Straceski et al, Qenomics, 19:341-9 (1994);- Chen 
etal.,Genomics,25:l-S(1995)). Useofthesereagents with populations or individuals will 
predict their risk for osteoporosis. Similarly, single nucleotide polymorphisms (SNPs). such 
20 as those shown in Table 4 above, can be used as well to predict risk for developing bone 
diseases or resistance to osteoporosis in the case of the HBM gene. It is also contemplated 
that single nucleotide polymorphisms (SNPs) such as those described above, may be used to 
predict the risk in a subject for developing arteriosclerosis and atherosclerosis and related 
conditions. 
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XXm. Methods of Use: Modulators of Tissue Calcification 

The calcification of tissues in the human body is weU documented. Towler et al., J. 
Biol, aiem., 273:30427-34 (1998) demonstrated that several proteins known to regulate 
calcification of the developing skull in' a model system are expressed in calcified aorta. The 
5 expression of Msx2, a gene transcribed in osteoprogenitor cells, in calcified vascular tissue 
indicates that genes which are important in bone development are involved in calcification of 
other tissues. Treatment with HBMprotein, agonists or antagonists is likely to ameUorate 
calcification (such as the vasculature, dentin and bone of the skull visera) due to its 
demonstrated effect on bone mineral density. In experimental. systems where tissue 
10 calcification is demonstrated, the over-expression or repression of Zmaxl activity pennits the 
identification of molecules that are directly regulated by the Zmaxl gene. Tliese genes are 
potential targets for ther^eutics aimed at modulating tissue calcification. For example, an 
animal, such as lie LDLR -/-, mouse is fed a high fat diet and is observed to demonstrate 
expression of markers of tissue calcification, including Zmaxl. These «^im.ic are then 
15 treated with antibodies to Zmaxl or HBM protein, antisense oligonucleotides directed against 
Zmaxl orHBMcDNA,orwith confounds known to bind the Zmaxl or HBM protein or its 
binding partner or Hgand. RNA or proteins are extracted .fio^i the vascular tissue and the 
relative expression levels of the genes expressed in the tissue are determined by methods well 
known in the art Genes that are regulated in the tissue are potential therapeutic^targets for 
20 pharmaceutical development as modulators of tissue calcification. 

The nucleic acids, proteins, peptides, amino acids, small molecules or other 
• phaimaceuticaUy useful compounds of the present invention that are to be given to an 
individual may be administered m the form of a composition with aphannaceuticaUy 
acceptable canier, excipient or diluent, which are weU known in the art The individual may " 
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be a mammal or a bird, preferably a human, a rat, a mouse or bird. Such compositions may 
be administered to an individual in a phannaceutically effective amount. The amount 
administered will vary depending on the condition being treated and the patient being treated. 
The compositions may be administered alone or in combination with other treatments. 

5 XXrV. Pharmaceutical Compositions 

The invention also contemplates phannaceutical compositions comprising a lipid 
mediating agent which modulates HBM and/or Zmaxl activity in combination with a 
Upoprotein modulating agent (e.g.. blofibrate. gemfibrozil, nicotinic acid, cholestyramine, 
cholestipol, lovastatin, simvastatin, pravastain, probucol, premarin or estradiol.)- Liprotein 
10 modulating agents can include compounds or compositions which modulate ie.g., up-regulate 
or down-regulate) LDL, VLDL, HDL or IDL levels. 

The Hpid mediating agent, which modulates HBM and/or Zmaxl activity, can include 
proteins, monoclonal antibodies or fragments thereof, chemicals, and mimetics. One 
contemplated pharmaceutical composition can comprise the monoclonal antibody and a 
15 pharmaceutically acceptable carrier. For the purposes of the present invention, a 

"pharmaceutically acceptable earner" can be any of the standard earners well known in the 
art. For example, suitable carriers can include phosphate buffered saline solutions, emulsions 
such as oil/water emulsions, and various types of wetting agents. Other earners can also 
include sterile solutions, tablets, coated tablets, and capsules. Typically, such carriers can 
20 also contain excipients such as starch, milk, sugar, types of clay, gelatin, stearic acid, or salts 
thereof magnesium or calcium sterate. talc, vegetable fats or oils, gums, glycerols, or o&er 
known excipients. Such carriers-can also include flavors and color additives, preservatives. 
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or other ingredients. Compositions comprising such carriers are formxilated by well known 
.conventional means. See Remington's Pharmaceutical Science (15th ed. 1980). 

For diagnostic purposes, the antibodies and recombinant binding proteins can be 
either labeled or unlabeled. Typically, diagnostic assays entail detecting flie formation of a 
5 complex through the binding of the monoclonal antibody or recombrnant binding protein to a 
HBM protein or Zmaxl protein. When unlabeled, the antibodies and recombinant binding 
proteins find use in agglutination assays. In addition, unlabeled antibodies can be used in 
combination with other labeled antibodies (second antibodies) that are specifically reactive 
with the monoclonal antibody or recombinant binding protein, such as antibodies specific for 
10 immunoglobulin. Alternatively, the monoclonal antibodies and recombinant binding proteins 
can be directly labeled. A wide variety of labels can be employed, such as radionuclides 
(e.g., ^^Tc, ^^^In, ^^I and ^^^I), fluorescers, enzymes, enzjme substrates, enzyme cofactors, 
en2yme inhibitors, ligands (particularly haptens), etc. Numerous types of immunoassays are 
well known in the art 

15 Commonly, the monoclonal antibodies and recombiaant binding proteins of the 

present invention are used in fluorescent assays, where the subject antibodies or recombinant 
binding proteins are conjugated to a fluorescent molecule, such as fluorescein isofliiocyanate 
(FTTC). 

The examples provided below are not meant to Umit the invention in any way, but 
20 . serve to provide preferred embodiments for the invention. 

EXAMPLES 

The present invention is described by reference to the following Examples, which are 
offered by way of illustration and are not intended to limit the invention in any manner. 
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Standard techniques weU known in the art or the techniques specifically described below 
• were utilized. 

Exam ple 1 

The propositus was referred by her physicians to the Creighton Osteoporosis Center 
5 forevaluationofwhatappearedtobeunusuaUydensebones. She was 18 years old and came 
to medical attention two years previous because of back pain, which was precipitated by an 
auto accident in which the car in which she was riding as a passenger was struck &om behind. 
Her only injniy was soft tissue injury to her lower back that was manifested by pain and 
muscle tenderness. There was no evidence of fracture or subluxation on radiographs. IHe 
10 pain lasted for two years, although she was able to attend school full time. By.the time she 
was seen in l3ae Center, the pain was nearly resolved and she was back to her usual activities 
as a high school student Physical exam revealed a nomial healthy young woman standing 66 
inches and weighing 128 pounds. Radiographs of the entire skeleton revealed dense looking 
bones with thick cortices. All bones of the skeleton were involved. Most importantly, the 
15 shapes of all the bones were entirely normal. Tlxe '^inal BMC was 94.48 grams inLl-4. and 
the spinal HMD was 1 .667 gm/cm^ in Ll-4. HMD was S.ej standard deviations (SD) above 
■ peak skeletal mass for women. These were measured by DXA udng a Hologic 2000~. Her 
mother was then scanned and a lumbar spinal BMC of 58.05 grams andBMD of 1.500 
gm/cm^ were found. Her mother's values place her 4.12 SD above peak mass and 4.98 SD 
20 above her peers. Hermother was 51 years old, stood 65 inches and weighed 140 pounds. 
Her mother was in excellent health with no history of musculoskeletal or other symptoms. 
Her father's lumbar BMC was 75.33 grams and his BMD was 1.118 gm/cm^ mse values 
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place him 0.25 SD above peak bone mass for males. He was in good health, stood 72 inches 
tall, and weighed 187 pounds. 

These clinical data suggested that the propositus inherited a trait from her mother, 
which resulted in very high bone mass, but an otherwise normal skeleton, and attention was 
focused on the maternal kindred. In U.S. Patent No. 5,691,153, twenty- two of these 
members had measurement of bone mass by DXA. In one case, the maternal grandfather of 
the propositus, was deceased, however, medical records, antemortem skeletal radiographs and 
a gall bladder specimen embedded in parafSn for DNA genotyping were obtained. His 
radiographs showed obvious extreme density of all of the bones avaflable for examination 
including the femur and the spine, and he was included among the affected members. In this 
invention, the pedigree has been expanded to include 37 informative individuals. These 
additions are a significant improvement over Ihe original kinship (Johnson et al. Am. J. Hum. 
Genet., 60:1326-1332 (1997)) because, among the fourteen individuals added since the 
original study, two individuals harbor key crossovers. X-linkage is ruled out by the presence 
of male-to-male transmission from individual 12 to 14 and 15, 



Example 1 

The present invention describes DNA sequences derived from two BAC clones from 
the HBM gene region, as evident in Table 6 below, which is an assembly of these clones. 
Clone b200e21-h (ATCC No. 980812; SEQ ID NOS: 10-11) was deposited at the American 
Type Culture Collection (ATCC), 10801 University Blvd., Manassas, VA 201 10-2209 
U.S.A, on December 30, 1997. Clone b527dl2-h (ATCC No. 980720; SEQ ID NOS: 5-9) 
was deposited at the American Type Culture Collection (ATCQ, 10801 University Blvd., 
Manassas, VA 201 10-2209 USA., on October 2, 1998. These sequences are unique reagents 



-124- 



wo 01/92891 



PCTAJSOl/16946 



that can be used by one skiUed in the art to identify DNA probes for the Zmaxl gene, PGR 
primers to amplify the gene, nucleotide polymorphisms in the Zmaxl gene, or regulatory 



elements of the Zmaxl gene. 



TABLE 6 



10 



Contig 


ATCCNb. 


SEQ ID NO. 


Length 


b527dl2-h__contig302G 


980720 


5 


3096 


b527dl2-h coiitig366G 


980720 . 


6 


26928 


b527dl2-h contig307G 


980720 


.7 


29430 


b527dl2-li contig308G 


980720 


8 


33769 


b527dl2-h_contig309G 


980720 


9 


72049 


b200e21-h_contigl 


980812 


10 


8705 


b200e21-li contig4 


980812 


11 


66933 



The disclosure of each of the patents, patent appHcations and pubUcations cited in the 
specification is hereby incorporated by reference herem in its entirety. 
15 Although the invention has been set forth in detail, one skilled in the art will 

recognize that nmnen>us changes and modifications can bejnade, and that such changes and 
■' modifications may be made without departing fi;om the spirit and scope of the invention. 

Example 3 

Since Zmaxl has similarity to the LDL receptor family of genes, it may be involved 
20 in Hpid metaboUsm. However, others have reported that lipid profile variables did not show 



significant association with bone mass 



and could not be used as indicators for bone mineral 



density (ZabagUa et al, "An exploratory study of association between Upid profile and bone 
mineral density in menopausal women in a Campinas reference hospital," Cad. Saude 
PubUcalA: 779-86 (1998)). Zmaxl may be normally involved in regulating bone density by 
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depositing calcium during bone remodeling. Hie HBM mutation may result in increased 
deposition thus conferring denser bone structure. Interestingly; atherosclerotic plaques 
contain calcified material and express a variety of genes involved in bone differentiation. 
To test whether the HBM gene was iuvolved in lipid regulation, biochemical tests 

5 were performed to measure serum level of various lipid containing molecules or precursors in 
affected and unaffected HBM family members to test whether the HBM mutation in the 
Zmaxl gene effects hpid metabolism. Table 7 shows the results of testing eight HBM 

. individuals and seven unaffected individuals. Wilcoxon rank-sum tests (non-parametric 
equivalent of a T-test) were performed to assess whether levels of biochemical markers fi^m 
10 affected HBM individuals deviated from unaffected individuals. The data obtained were. 

analyzed separately by gender, as well as by combining values from males and females, when 
appropriate. 

Standard diagnostic protocols were used to determine the concentration (mg/dL) with • 
tiiglycerides, cholesterol, high density Hpoprptein (HDL), low density Upoprotein (LDL), 
15 veiy low density lipoprotein (VLDL). ^oHpoprotein A-1 (APO A-1). apolipoproteinB (APO 
B). and lipoprotein a (LIPOa). For such procedures, see for example, F. W. Hemming, LiPm 
Analysis (Bios Scientific Pub. 1996) and J. M. Ordovas, LiPOPRO-raiN Protocols 
(Humana Press Inc., 1997). The genotype for apolipoprotein E (APO E) was also reported. 
There are three common alleles (e.g., E2, E3 and E4). The affected and unaffected HBM 
20 family members are heterozygous or homozygous for the alleles. 

The results obtained were statistically significant: . (1) Triglyceride levels were 
generaUy lower in affected individuals than in unaffected individuals, aod (2) very low 
density Upoprotein (VLDL) levels were generally lower in affected individuals than in 
unaffected individuals. Additionally, the foUowing conq>arisons approached statistical 
25 significance (p=0.06): (1) high density Hpoprolem (HDL) levels were higher ia affected 
males than in unaffected males, and (2) the ratio of low density lipoprotein (LDL) to high 
density Hpoprotein (HDL) was generally higher in affected males tiian in unaffected males. 

In Table 7, "ARUP" is ARUP Laboratories, 500 Chipeta Way, Salt Lake City, UT 
84108 where one of tiie studies was performed "SJH" refers to the second center which 
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perfoimed these studies, Creighton Medical Laboratories, 28th & Burt, Dental-Rm 306, 
Omaha, NE 68178. APO-Al, APO-B aad LIPO-a are reported in mg.dL. Total serum levels 

also are in mg/dL. 

All cited patents and publications referred to in this application are herein 
5 incorporated by reference in their entirety. 
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CI.AIMS 

What is claimed is: 

1 . A method of identifying a molecule involved in lipid regulation comprising 
identifying a molecule that binds to, or that inhibits binding of a molecule to. HBM or 
Zmaxl. 

2. The method of claim 2, wherein said molecule is a protein. 

3. The method ofclaim 2, further comprising producmg an antibody to the 
protein. 

4. A method for identifying a protein involved in lipid regulation comprising 
identifying a protein that has an expression level that is different in a first host comprising the 
Zmaxl gene when compared to a second host comprising the HBM gene. 

5 . The method of claim 4, wherdn the host is an animal. 

6. A method for identification of a candidate molecule involved in Hpid 

regulation comprising: ^. 

(A) identifying amolecule that binds to. or that inhibits binding of a molecule to. 
the nucleic acid sequence of SEQ ID NO: 1 or a Zmaxl nucleic acid comprising a 

polymorphism of Table 4; 

(B) identifying a molecule that binds to, or that inhibits binding of a molecule to, 

the nucleic acid sequence of SEQ ID NO: 2; and 

-(C) comparing the extent of binding, or the ejctent of inhibition of binding, of the 
molecule to each nucleic acid sequence, wherein the molecule that binds, or inhibits binding, 
more or less to the nucleic acid sequence of SEQ ID NO: 2 or the nucleic acid sequence of 
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SEQIDNO: 1 oraZmaxl nucleic acid comprising a polymorphism of Table 4 is the 
candidate molecule. 

7. The method of claim 6, wherein the candidate molecule is a protein, an mKNA 
or an antisense nucleic acid. 

8. A method for testing a substance as a therapeutic agent for modulating lipid 
levels comprising administering a nucleic acid comprising SEQ ID NO: 2 or a nucleic acid 
sequence with an HBM polymorphism to a subject, and assessing whether lipid levels are 
modulated- 

9. The method of claim 8, wherein the subject is an anim al and the aniTn al is 
selected from the group consisting of: Uvestock, primates, humans, canines, felines, rodents, 
birds, reptiles, fish, and amphibians. 

10. A method for testing a substance as a therapeutic agent for modulating Kpid 
levels comprismg administering a protein comprising SEQ ID NO: 4 or a Zmaxl protem 
comprising a polymorphism of Table 4 to a subject, and assessing whether lipid levels are 
modulated. 

11. . A method of pharmaceutical development for treating lipid-mediated disorders 
comprising identifying a molecule that bmds to the amino acid sequence of SEQ ID NO: 4 or 
to a Zmaxl protein comprising a polymorphism of Table 4. 

12. The method of claim 1 1, wherein the molecule inhibits or enhances the 
function of the ammo acid. 

13. A method of pharmaceutical development for treatment of lipid-mediated 
disorders comprising: 
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(A) constructing a first host that contains the Zmaxl gene or protein; 

(B) constructing a second host that contains the HBM gene or protein; 

(C) analyzing a difference between the first host and the second host; and 

(D) identifying a molecule that, when added to the first host, causes fee first host 
to exhibit a characteristic feature of the second host. 

14. The method of claim 13, wherein the host is a cell-free extract, a ceU or an 

animal. 

1 5. The method of claim 13, wherdn the difference is a surrogate marker. 

16. A method of regulating Upid levels in a host comprising administering the 
amino acid sequence comprising SEQ ID NO: 4 to a somatic cell or to a germ-line cell of a 
host suffering from a Upid-mediated disorder. 

17. The method of claim 16, wherein the host is Uvestock, primates, humans, 
canines, fehnes, rodents, birds, reptiles, fish, or amphibians. 

19. A method for treating or preventing a Upid-mediated disorder in an animal 
comprising transferring a nucleic acid sequence comprising SEQ ID NO: 2 or a Zmaxl 
nucleic acid comprising a polymorphism of Table 4 into a somatic cell or a germ-line cell 
of an animal suffering firom a lipid-mediated disorder. 

20. The method of claim 19. wherein tiie animal is livestock, primates, humans, 
canines, felines, yodents, burds, reptiles, fish, or amphibians. 

21 . A inethod of tireating or preventing arteriosclerosis or an arteriosclerosis- 
associated condition comprising administering an amino acid sequence comprising SEQ ID 
NO: 4 to a patient in need thereof. 
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22. The method of claim 2 1 , wherein the patient is livestock, primates, humans, 
canines, felines, rodents, birds, reptiles, fish, or amphibians. 

23. The method of claim 21, wherein the amino acid sequence administered to a 
patient in need thereof comprises the extraceUnlar domain of the amino acid sequence 
comprising SEQ ID NO: 4. 

24. The method of claim 21, wh^ein the amino acid sequence administered to a 
patient in need thereof comprises the intracellular domain of the amino acid sequence 
comprising SEQ ID NO: 4. 

25. A method for treating or preventing a lipid-mediated disorders conaprising 
administering a molecule that biuds to a nucleic acid sequence comprising SEQ ID NO: 2 or 
a Zmaxl nucleic acid comprising a polymorphism of Table 4 to a patient in need thereof. 

26. The method of claim 25, wherein the patient is livestock, primates, humans, 
cardnes, felines, rodents, birds, reptiles, fish, or ampliibians. 

27. A method for treating or preventing lipid-mediated disorders comprising 
administering an antibody to a patient in need thereof wherein the antibody is to the amino 
acid sequence comprising SEQ ID NO: 4, . " 

28. A method for diagnostic screening for a'genetic predisposition to 
arteriosclerosis or an arteriosclerosis associated condition or a lipid-mediated disorder 
comprising screening a sample from a patient with a nucleotide sequence derived fi"om the 
genomic or cDNA nucleic acid sequence of HBM. 
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29. The method of claim 28, wherein the screening involves, performing a 
haplotype analysis using the nucleic acid sequence comprising SEQ ID NO: 2 and 
detemiining whether the subject contains the Zmaxl gene or lacks an HBM polymorphism. 

30. A diagnostic assay for determining a predisposition for a lipid-mediated 
disorders comprising an antibody to the HBM protein and an antibody to the Zmaxl protein. 

31. A method of expressing the HBM protein in tissue comprising constructing an 
expression vector comprising a promoter that directs expression in tissue operably hnked to 
SEQ ID NO:2 and the tissue in which the HBM protein is expressed is a lipid regulating cell 
or a cell involved in hpid metabolism. 

32. The method of claim 3 1, wherein the tissue is liver. 

33. Themethodofclaim31,whereinthepromoterthatdirectsexpressionintissue 
is an osteocalcin promoto: or an AML-3 promoter. 

34. A method of modulating lipid levels in a subject by administering an HBM 
protein or a Zmaxl protein comprising a polymorphism of Table 4. 



35. 



The method of claun 34, wherein the HBM protein comprises SEQ ID NO: 4. 



36. Hie method of claim34, wherein the hpid modulated is selected from the 
group consisting of: VLDL, LDL, IDL, HDL. LIPOa, APO A-l. APO B and APO E. 

37. . AmethodofmodulatingMpidlevelsinasubjectbyadministeringanagent 
which regulates HBM or Zmaxl activity. 
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38. The method of claim 37, wherein the hpid modulated is selected from the 
group consisting of: VLDL, LDL, IDL, HDL, LIPOa, APO A-1, APO B and APO E. 

39. The method of claim 37, wherein the regulation of HBM or Zmaxl activity is 
modulates gene transcription, protein translation or Zmaxl or HBM protein binding to its 
cognate target thereby regulating lipid levels. 

40. A composition for treating a lipid-mediated condition comprising an agent that 
modulates lipid levels by regulating Zmaxl or HBM activity and a lipoprotein modulating 
agent with a pharmaceutically acceptable carrier. 

41 . The composition of claim 40, wherein the lipoprotein modulating agent is 
blofibrate, gemfibrozil, nicotinic acid, cholestyrandne, cholestipol, lovastatin, simvastatin, 
pravastatin, probucol, premarin or estradiol. 

42. The composition of claim 40, wherein the lipoprotein modulating agent 
modulates LDL levels. 

43. The composition of claim 42, wherein the lipoprotein modulating agent is 
selected from the group consisting of bile acid binding resins, HMG-CoA reductase inhibitors 
and estrogens. 

44. A method of treating a subject suffering from a lipid-mediated condition 
comprising the step of administering the composition of claim 40. 

45. The method of claim 44, wherein the lipid-mediated condition is 
atherosclerosis, arteriosclerosis, or a disease associated with atherosclerosis or 
arteriosclerosis. 
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46. A combination therapy for treating a subject suffering from a Upid-mediated 
disease or condition comprising administering to a subject an agent which regulates HBM or 
Zmaxl and an agent which regulates a lipoprotein. 

47. The combination therapy of claim 46, wherein the agent regulating hpoprotein 
concentrations is blofibrate, gemfibrozil, nicotinic acid, cholestyramine, cholestipol, 
lovastatin, simvastatin, pravastain, probucol, premarin or estiradiol. 

48. The method of claim 46, wherein the lipid-mediated disease is atherosclerosis, 
arteriosclerosis, an atherosclerosis associated condition or an arteriosclerosis associated . 
condition. 
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Exon 1 

gcccgtccggccgccggacaacaiggaggcagcgccgcccgggccgcc 

Exon 2 Coordinates: 527dl2_Contig308G 30944-30549 

gccccacagCCTCGCCGCTCCTGCTATTTGCCAACCGCCGGGACGTACGGCT 
GGTGGACGCCGGCGGAGTCAAGCTGGAGTCCACCATCGTGGTCAGCGG 
SfJ^^^f^'^^^^^^^^^^^'T^G^^CCAGTTTTCCAAGGGAGC^^^ 

tactggacagacgtgagcgaggaggccatcaagcagacctacctgaacc 

AGACGGGGGCCGCCGTGCAGAACGTGGTCATCTCCGGCCTGGTCTCTCr 

cgacggcctcgcctgcgactgggtgggcaagaagctgtactggacgga 

CTCAGAGACCAACCGCATCGAGGTGGCCAACCTCAATGGCACATCCCGG 
... 9408 nt ... 

Exon 3 Coordinates: 527dl2_Contig308G 21141-20945 

ccccgtcacagGTACATGTACTGGACAGACTGGGGTGAGACGCCCCGGATTGA 

GCGGGCAGGGATGGATGGCAGCACCCGGAAGATCATTGTGGACTCGGA 

CATTTACTGGCCCAATGGACTGACCATCGACCTGGAGGAGCAGAAGCTr 

S^G^SS^'SS^'''^^^^^'^^^^^^ 
... 6094nt... 

Exon 4 Coordinates: 527dl2_Contlg308G 15047-14850 

tccctgactgcagGCAGAAGGTGGTGGAGGGCAGCCTGACGCACCCCTTCGCrr 
TGACGCTCTCCGGGGACACTCTGTACTGGACAGACTGGCAGACCCGCTC 
CATCCATGCCTGCAACAAGCGCACTGGGGGGAAGAGGAAGGAGATCC^^ 

lGCCT??CT^^aSc^f^^^^^^ 
... 1827 nt... 

Exon 5 Coordinates: 527dl2_Contig308G 13220-13088 

tttctcagTCCACACTCGCTGTGAGGAGGACAATGGCGGCTGCTCCCACCTGT 
GCCTGCTGTCCCCAAGCGAGCCTTTCTACACATGCGCCTGCCCCACGGG 
TGTGCAGCTGCAGGACAACGGCAGGACGTGTAAGGCAGgtgaggcgSgSTcg 

FIG, 3A 
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, 20923 nt 



Evon 6 Coordinates: 527dl2 Contig309G 7705-8100 

Saggatctcgctggaca^ 

GACGACATCCGGCACGCCATTGCCATCGACTACGACCCGCTAGAGGGCT 

ATGTCTACTGGACAGATGACGAGGTGCGGGCCATCCGCAGGGC^^ 

GGACGGGTCTGGGGCGCAGACGCTGGTCAACACCGAGATCAACGACCC 

cgItcgcatcgcggtcgactgggtggcccgaaacctctactggaccgac 

ACGGGCACGGACCGCATCGAGGTGACGCGCCTCAACGGCACCTCCCGC^ 

agatcctcgtgtcggaggacctggacgagccccgagccat^ 

CCCCGTGATGGGgtaagacgggc 
3211 nt 




13445 nt 



Evon 8 Coordinates: 527dl2 Contig309G 24927-25143 
!;XtciGTGATC^^ 

GACAAOTCCCGCACATTITCGGGTTCACGCTGCTGGGGGACTTCATCT 
ACTGGACTGACTGGCAGCGCCGCAGCATCGAGCGGGTGCACAAGG^^ 
GGCCAGCCGGGACGTCATCATTGACCAGCTGCCCGACCTGATGGGGCTC 
AAAGCTGTGAATGTGGCCAAGGTCGTCGgtgagtccggggggtc 



..2826 nt 



Exon 9 Coordinates: 527dl2_Contig309G 27969-28256 ^ _ _^ ^ ^ _ 

ettcecttccagGAACCAACCCGTGTGCGGACAGGAACGGGGGGTGCAGCCACC 

TGTGCTTCTTCA^^ 

GGAGCTGCTGAGTGACATGAAGACCTGCATCGTGCCTGAGGCCTO^ 
GTCrrCACCAGCAGAGCCGCCATCCACAGGATCTCCCTCGAGACCAATA 
ACAACGACGTGGCCATCCCGCTCACGGGCGTCAAGGAGGCCTCAGCCCT 
GGACTTTGATGTGTCCAACAACCACATCTACTGGACAGACGTCAGCCTG 

A ArJortaacstcrffffC 



AAGgtagcgtgggc 
3102 



FIG. 3B 
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Exon 10 Coordinates: 527dl2_Contig309G 31358-31582 

cctgctgccagACCATCAGCCGCGCCTTCATGAACGGGAGCTCGGTGGAGCAC 
GTGGTGGAGTTTGGCCTTGACTACCCCGAGGGCATGGCCGTTG^^^ 

tgggcaagaacctctactgggccgacactgggaccaacagaa?cgSgt 

GGCGCGGCTGGACGGGCAGTTCCGGCAAGTCCTCGTCTGG^^ 
GGACAACCCGAGGTCGCTGGCCCTGGATCCCACCAAGGGgTaSStt^ctS 
1297 nt 

Exon 11 Coordinates: 527dl2_Contig309G 32879-33064 

gtgccttccagCTACATCTACTGGACCGAGTGGGGCGGCAAGCCGAGGATCGT 

gcgggccttcatggacgggaccaactgcatgacgctggiSg^^ 

???S?S?^''^^^^^^^^^CC^TTGACTACGCTGACCAGC^^^ 
GGACCGACCTGGACACCAACATGATCGAGTCGTCCAACATGCTGGgtgaggg 

2069 nt 

Exon 12 Coordinates: 527dl2_Contig309G 35133-35454 

gtgttcatgcagGTCAGGAGCGGGTCGTGATTGCCGACGATCTCCCGCACCCGT 
GCACAGCATTGAGCGGGCCGACAAGACTAGCGGCCGGAACCGCACCCTC 

atccagggccacctggacttcgtgatggacatcctggtgAccactot 

CCCGCCAGGATGGCCTCAATGACTGTATGCACAACAACGGGCACT^^ 

gcagctgtgccttgccatccccggcggccaccgctgc^ct^^ 

CACTACACCCTGGACCCCAGCAGCCGCAACTGCAGCCgtaagtgccte^ 
^006 nt 



Exon 13 Coordinates: 527dl2_Contig309G 37460-37659 

gcctcctctaCGCCCACCACCTTCTTGCTGTTCAGCCAGAAATCTGCCATCAGT 
CGGATGATCCCGGACGACCAGCACAGCCCGGATCTCATCCTGcScTGC 
^IS5i^f^?''''^^^^T^^^^CATCGACTATGACCCA^^^ 

^A^CCA^^IS~^ 



.6965 nt. 



FIG. 3C 
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Exon 14 Coordinates: 527dl2_Contig309G 44624-44832 , ^ , ^ . 

cm^cttacagCCCTITGTTTTGACCTCTCTGAGC^^^ 

GCCATGGGGGTGGTGCTGCGTGGGGACCGCGACAAGCCCAGGGCCATC 
GTCGTCAACGCGGAGCGAGGgtaggaggccaac 



.1404 nt. 



Fvon 15 Coordinates: 527dl2 Contig309G 46236-46427 

TCGAACGCGCAGCCCTGGACGGCACCGAGCGCGAGGTCCTCTTCACCAC 
CGGCCTCATCCGCCeTGTGGCCCTGGTGGTGGACAACACACTGGGCAAG 

ctgttctgggWcgcggacctg^^^ 

CAGgtacgcgccccgg 
686 nt 

Exon 16 Coordinates: 527dl2_Contig309G 47113-47322 . .^rrrrrArr 

ggctgcttgcagGGGCCAACCGCCTGACCCTGGAGGACGCCAACATCG^^ 
rTCTCGGCCTGACCATCCTTGGCAAGCATCTCTACTGGATCGACCGCCA 
rSGCAGSGATCGAGCGTGTGGAGAAGACCACCGGGGACAAGCGGAC 

??G?fTCclGGGCCG^^^ 
GAAGTCAGCCTGGAGGAGTTCTgtacgtgggggc 

3884 nt 

T7vnn 17 roordinates: 527dl2 Contig309G 51206-51331 

Sc^ato^Agggtg^^ 

CTCGTGCTCCTGCAGAACCTGCTGACCTGTGGAGgtaggtgtgacctaggtgc 
....3905 nt 

Exon 18 Coordinates: 527dl2_Contig309G 55236-55472 

rs=ffi2?G^^^^^^^^^ 

5cCCGAGTGCGATGACC^^^ 

^?:CGCCCAGTTCCCCTGCGCGCGGGGTCAGTGTGTGGACCTGCGCCTG^ 
GCTGCGACGGCGAGGCAGACTGTCAGGACCGCTCAGACGAGGTGGACT 

riTCZ A. r Ocrteaceccctcc 



GTGACGgtgaggccctcc 
3052 nt 



FIG. 3D 



SUBSTITUTE SHEET (RULE 26) 



wo 01/92891 



PCT/USOl/16946 



9/29 



Exon 19 Coordinates: 527dl2_Contig309G 58524-58634 

tctccttgcagCCATCTGCCTGCCCAACCAGTTCCGGTGTGCGAGCGGCCAGTG 

TGTCCTCATCAAACAGCAGTGCGACTCCTTCCCCGACTGTATCGACGGCT 
CCGACGAGCTCATGTGTGgtgagccagctt 

1448 nt 



Exon 20 Coordinates: 527dl2_Contig309G 60082-60319 

gtttgtctctggcagAAATCACCAAGCCGCCCTCAGACGACAGCCCGGCCCACAGC 
AGTGCCATCGGGCCCGTCATTGGCATCATCCTCtCTCTCTTCGTCAT^ GG 
TGGTGTCTATTTTGTGTGCCAGCGCGTGGTGTGCCAGCGCTATGCG^JG 
GCCAACGGGCCCTTCCCGCACGAGTATGTCAGCGGGACCCCGCACGTGC 

CCCTCAATTTCATAGCCCCGGGCGGTTCCCAGCATGGCCCCTTCACAGgta 

aggagcctgagatatggaa 



....1095 nt 



Exon 21 Coordinates: 527dl2_Contig309G 61414-61552 

cttccctgccagGCATCGCATGCGGAAAGTCCATGATGAGCTCCGTGAGCCTGA 

TGGGGGGCCGGGGCGGGGTGCCCCTCTACGACCGGAACCACGTCACAG 

GGGCCTCGTCCAGCAGCTCGTCCAGCACGAAGGCCACGCTGTACCCGCC 
Ggtgaggggcggg 



6513 nt 

Exon 22 Coordinates: 527dl2_Contig309G 68065-68162 

ttggctctcctcagATCCTGAACCCGCCGCCCTCCCCGGCCACGGACCCCTCCCT 

GTACAACATGGACATGTTCTACTCTTCAAACATTCCGGCCACTGCGAGAC 
CGTACAGgtaggacatcccctgcag 



.2273 nt, 



FIG. 3E 
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ExoB 23 coordinates: 527dl2_Contig309G 70^^^^^ 




^^^^^^^^^^^^^ 

FIG. 3F 
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1 5 10 

ctg ctg ctg ctg ctg ctg ctg gcg ctg tgc ggc tgc ccg gcc ccc gcc 157 
Leu Leu Leu Leu Leu Leu Leu Ala Leu Cys Gly Cys Pro Ala Pro Ala 
.. 15 20 25 

gcg gcc teg ccg etc ctg eta ttt gcc aac cgc egg gac gta egg ctg 205 
Ala Ala Ser Pro Leu Leu Leu Phe Ala Asn Arg Arg Asp Val Arg Leu 
30 35 40 45 

gtg gac gcc ggc gga gtc aag ctg gag tec ace ate gtg gte age gge • 253 
Val Asp Ala Gly Gly Val Lys Leu Glu Ser Thx He Val Val Ser Gly 

50 55 60 

ctg gag gat gcg gee gea gtg gac tte eag ttt tee aag gga gcc gtg 30 1 
Leu Glu Asp Ala Ala Ala Val Asp Phe Gin Phe Ser Lys Gly Ala Val 

65 70 75 

tac tgg aca gac gtg age gag gag gcc ate aag eag aec tac ctg aac 349 
Tyr Trp Thr Asp Val Ser Glu Glu Ala He Lys Ghi Thr Tyr Leu Asn 

80 85 90 

cag acg ggg gcc gcc gtg eag aac gtg gte ate tee gge ctg gtc tct 397 
Gin Thr Gly Ala Ala Val Gin Asn Val Val He Ser Gly Leu Val Ser 

95 100 105 

ccc gac ggc etc gcc tgc gac tgg gtg ggc aag aag ctg tac tgg aeg 445 
Pro Asp Gly Leu Ala Cys Asp Trp Val Gly Lys Lys Leu Tyr Trp Thr 
110 115 120 125 

gac tea gag aec aac cgc ate gag gtg gee aac etc aat ggc aca tec 493 
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SEQUENCE LISTING 

< 110> John P. Carulli et al. 

< 120 > REGULATING LIPID LEVELS VIA THE ZMAXl or HBM GENE 
<130> 032796-019 

< 150 > Unassigned 

< 151 > 2000-05-26 
<150> US 09/543.771 

< 151 > 2000-04-05 
<150> US 09/544,398 

< 151 > 2000-04-05 
<160> 62 

<210> 1 
<211> 5120 

r 

<212> DNA 
<213> Homo S25)iens 

<400> 1 



actaaagcgc cgccgccgcg ccatggagcc cgagtgagcg cggcgcgggc ccgtccggcc 
gccggacaac atg gag gca gcg ccg ccc ggg ccg ccg tgg ccg ctg ctg 109 
Met GlTi Ala Ala Pro Pro Gly Pro Pro Trp Pro Leu Leu 



60 
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Asp Ser Glu Thi Asn Arg He Glu Val Ala Asn Leu Asn Gly Tbi Ser 

130 135 140 

cggaaggtgctcttctggcaggaccttgaccagccgagggccatcgcc " 541 

Arg Lys Val Leu Phe Trp Gin Asp Leu Asp Gin Pro Arg Ala He Ala 

145 150 155 

ttg gac ccc get cac ggg tac atg tac tgg aca gac tgg ggt gag acg 589 
Leu Asp Pro Ala His Gly Tyr Met Tyr Trp Thr Asp Trp Gly Glu Thr 

160 165 170 

ccc egg att gag egg gca ggg atg gat ggc age ace egg aag ate att 637 
Pro Arg He Glu Arg Ala Gly Met Asp Gly Ser Thr Arg Lys He lie 

175 180 185 

gtg gac teg gac att tac tgg eee aat gga etg ace ate gac etg gag 685 
Val Asp Ser Asp He Tyr Trp Pro Asn Gly Leu Thr He Asp Leu Glu 
190 195 200 205 

gag cag aag etc tac tgg get gae gee aag etc age tte ate eae cgt 733 
Glu Gin Lys Leu Tyr Trp Ala Asp Ala Lys Leu Ser Phe He His Arg 

210 215 220 

gee aac etg gac ggc teg tte egg cag aag gtg gtg gag gge age etg 781 
Ala Asn Leu Asp Gly Ser Phe Arg Gin Lys Val Val Glu Gly Ser Leu 

225 230 235 

acg cac ccc tte gee etg acg etc tec ggg gae act etg tac tgg aca 829 
Thr His Pro Phe Ala Leu Thr Leu Ser Gly Asp Thr Leu Tyr Trp Thr 
240 245 250 
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gac tgg cag acc cgc tec ate cat gcc tgc aac aag cgc act ggg ggg 877 
Asp Trp Gin Thr Arg Ser lie His Ala Cys Asn Lys Arg Thr Gly Gly 

255 260 265 

aag agg aag gag ate ctg agt gcc etc tac tea ccc atg gac ate cag 925 
Lys Arg Lys Glu He Leu Ser Ala Leu Tyr Ser Pro Met Asp He Gin 
270 275 280 285 

gtg ctg age cag gag egg cag cet ttc ttc cac act cgc tgt gag gag 973 
Val Leu Ser Gin Glu Arg Gin Pro Phe Phe His Thr Arg Cys Glu Glu 

290 295 300 

gac aat ggc ggc tgc tec cac ctg tgc ctg ctg tec cca age gag cet 1021 
Asp Asn Gly Gly Cys Ser His Leu Cys Leu Leu Ser Pro Ser Glu Pro 

305 310 315 

ttc tac aca tgc gcc tgc ccc acg ggt gtg cag ctg cag gac aac ggc 1069 
Phe Tyr Thr Cys Ala Cys Pro Thr Gly Val Gin Leu Ghi Asp Asn Gly 

320 325 330 

agg acg tgt aag gca gga gcc gag gag gtg ctg ctg ctg gcc egg egg 11 17 
Arg Thr Cys Lys Ala Gly Ala Glu Glu Val Leu Leu Leu Ala Arg Arg 

335 340 345 

acg gac eta egg agg ate teg ctg gac acg ceg gac ttc acc gac ate 1165 
Thr Asp Leu Arg Arg lie Ser Leu Asp Thr Pro Asp Phe Thr Asp He 
350 355 360 365 

gtg ctg cag gtg gac gac ate egg cac gcc att gcc ate gac tac gac 1213 
Val Leu Ghi Val Asp Asp Be Arg His Ala He Ala He Asp Tyr Asp 



PCTAJSOl/16946 

WO 01/92891 5 



370 375 380 

ccg eta gag ggc .at gtc .ac tgg aca gat gac gag gtg egg gee ate 1261 
P.0 Glu Gly Tyr Val Tyr Trp Tbr Asp Asp G.u Val Axg Ala Be 

385 390 395 

cgc agg gcg <^ eg gac ggg ^ glE gcg cag acg eg gtc aac acc 1309 
Arg Axg Ala Tyr Leu Asp G.y Se. Gly Ala Gta m 1- Val A.n Tta 

400 405 «0 

^ ate aae gae cee gat gge ate geg gte gae tgg gtg gee ega aae 1357 
0>u ne Asn ASP Pro Asp Gly Be Ala Val Asp Ttp Val Ala Axg Asn 



415 420 425 

ctetaetggaeegaeaegggeaeggaeegcategaggtgacgegeetc 1405 

Tyr Trp Tl>r Asp Tta Gly T^ Asp Arg De Glu Val T.. Arg Lea 
430 435 440 445 

gge aee tee ege aag ate etg gtg teg gag gae etg gae gag eee 1453 



Asn Gly Tin Set 



Arg Lys Be Leu Val Set Gto Asp Leu Asp Glu Pro 



450 455 460 

ega gee ate gea etg cae eee gtg atg gge ete atg tae tgg aea gae 1501 
Arg Ala ne Ala Leu His Pro Val Met ay Uu Met IVr Trp Tta ASP 

465 470 475 

tgg gga gag aae ee. aaa ate gag tg. gee aae ttg gat ggg eag gag 1549 
Gly Glu Asu Pro Lys Be Glu Cys Ala Asn Leu Asp Gly Gin Glu 

480 485 490 

egg egt gtg etg gte aat gee tee ete ggg tgg eee aae gge etg gee 1597 
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Arg Arg Val Leu Val Asn Ala Ser Leu Gly Trp Pro Asn Gly Leu Ala 

495 500 505 

ctg gac ctg cag gag ggg aag etc tac tgg gga gac gcc aag aca gac 1645 
Leu Asp Leu Gin Glu Gly Lys Leu Tyr Tip Gly Asp Ala Lys Thr Asp 
510 515 520 525 

aag ate gag gtg ate aat gtt gat ggg acg aag agg egg acc cte etg 1693 
Lys lie Glu Val lie Asn Val Asp Gly Thr Lys Arg Arg Thr Leu Leu 

530 535 540 

gag gac aag etc ccg eac att ttc ggg ttc acg ctg ctg ggg gac ttc 1741 
Glu Asp Lys Leu Pro His He Phe Gly Phe Thr Leu Leu Gly Asp Phe 

545 550 555 

ate tac tgg act gac tgg cag cgc cgc age ate gag egg gtg cac aag 1789 
lie Tyr Tip Thr Asp Tip Gin Arg Arg Ser De Glu Arg Val His Lys 

560 565 570 

gte aag gcc age egg gac gtc ate att gac cag ctg ecc gac ctg atg 1837 
Val Lys Ala Ser Arg Asp Val De Be Asp Gin Leu Pro Asp Leu Met 

575 580 585 

ggg etc aaa get gtg aat gtg gcc aag gtc gtc gga acc aac ccg tgt 1885 
Gly Leu Lys Ala Val Asn Val Ala Lys Val Val Gly Thr Asn Pro Cys 
590 595 600 605 

gcg gac agg aac ggg ggg tgc age cac ctg tgc ttc ttc aca ecc cac 1933 
Ala Asp Arg Asn Gly Gly Cys Ser His Leu Cys Phe Phe Thr Pro His 
610 615 620 
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gca acc egg tgt ggc tgc ccc ate ggc ctg gag ctg etg agt gac atg 1981 
Ala Thr Arg Cys Gly Cys Pro He Gly Leu Glu Leu Leu Ser Asp Met 

625 630 635 

aag acc tgc ate gtg cet gag gee ttc ttg gtc ttc acc age aga gee 2029 
Lys Thi Cys He Val Pro Glu Ala Phe Leu Val Phe Thr Ser Arg Ala 

640 645 650 

gcc ate cac agg ate tee etc gag acc aat aae aac gac gtg gcc ate 2077 
Ala He His Arg He Ser Leu Glu Thr Asn Asn Asn Asp Val Ala He 

655 660 665 

ccg etc acg ggc gtc aag gag gee tea gee etg gac ttt gat gtg tec 2125 
Pro Leu Thr Gly Val Lys Glu Ala Ser Ala Leu Asp Phe Asp Val Ser 
670 675 680 685 

aac aac cac ate tac tgg aca gac gtc age ctg aag acc ate age egc 2173 
Asn Asn His lie Tyr Trp Thr Asp Val Ser Leu Lys Thr He Ser Arg 

690 695 700 

gcc ttc atg aae ggg age teg gtg gag cac gtg gtg gag ttt ggc ett 2221 
Ala Phe Met Asn Gly Ser Ser Val Glu His Val Val Glu Phe Gly Leu 

705 710 715 

gac tac ccc gag ggc atg gcc gtt gae tgg atg ggc aag aac etc tac 2269 
Asp Tyr Pro Glu Gly Met Ala Val Asp Trp Met Gly Lys Asn Leu Tyr 

720 725 730 

tgg gcc gac act ggg acc aac aga ate gaa gtg geg egg ctg gac ggg 23 17 
Trp Ala Asp Thr Gly Thr Asn Arg He Glu Val Ala Arg Leu Asp Gly 
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735 740 745 

cag ttc egg caa gtc etc gtg tgg agg gac ttg gae aac ccg agg teg 2365 
Gin Phe Arg Gin Val Leu Val Tip Arg Asp Leu Asp Asn Pro Arg Ser 
750 755 760 765 

ctg gcc ctg gat ccc acc aag ggc tac ate tac tgg acc gag tgg ggc 2413 
Leu Ala Leu Asp Pro Thr Lys Gly Tyr lie Tyr Trp Thr Glu Tip Gly 

770 775 780 

ggc aag ccg agg ate gtg egg gcc ttc atg gac ggg acc aac tgc atg 2461 
Gly Lys Pro Arg lie Val Arg Ala Phe Met Asp Gly Thr Asn Cys Met 

785 790 795 

aeg ctg gtg gac aag gtg ggc egg gee aac gae etc ace att gae tae 2509 
Thr Leu Val Asp Lys Val Gly Arg Ala Asn Asp Leu Thr lie Asp Tyr 

800 805 810 

get gac cag cgc etc tac tgg acc gac ctg gae ace aac atg ate gag 2557 
Ala Asp Gin Arg Leu Tyr Tip Thr Asp Leu Asp Thr Asn Met He Glu 

815 820 825 

teg tec aac atg ctg ggt cag gag egg gtc gtg att gcc gac gat etc 2605 
Ser Ser Asn Met Leu Gly Gin Glu Arg Val Val He Ala Asp Asp Leu 
830 835 840 845 

ccg cac ccg ttc ggt ctg aeg cag tac age gat tat ate tae tgg aca 2653 
Pro His Pro Phe Gly Leu Thr Glu Tyr Ser Asp Tyr lie Tyr Trp Thr 

850 855 860 

gac tgg aat ctg cac age att gag egg gee gae aag act age ggc egg 2701 
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Asp Trp Asn Leu His Ser Ue Glu Arg Ala Asp Lys Thr Set Gly Axg 

865 870 875 

aac cgc acc etc ate cag ggc cac ctg gac ttc gtg atg gac ate etg 2749 
Asn Arg Thr Leu He Gin Gly His Leu Asp Phe Val Met Asp He Leu 

880 885 . 890 . 

gtgttecaeteetcccgccaggatggceteaatgaetgtatgcacaac 2797 

Val Phe His Ser Ser Arg Gin Asp Gly Leu Asn Asp Cys Met His Asn 

895 900 905 

aac ggg cag tgt ggg cag ctg tgc ctt gcc ate eee gge ggc cac cgc 2845 
Asn Gly Gin Cys Gly GlnLeu Cys Leu Ala He Pro Gly Gly His Arg 
910 915 920 925 

tgc ggc tgc gcc tea cac tac acc etg gac eee age age cgc aac tgc 2893 
Cys Gly Cys Ala Ser His Tyr Ita Leu Asp Pro Ser Ser Arg Asn Cys 

930 935 940 

age ecg cec ace ace ttc ttg ctg tte age cag aaa tct gcc ate agt 2941 
Set Pro Pro Thr Thr Phe Leu Leu Phe Ser Ghi Lys Ser Ala He Ser 

945 950 955 

egg atg ate ecg gac gac cag cac age ecg gat etc ate etg eee ctg 2989 
Axg Met ne Pro Asp Asp Ghi His Ser Pro Asp Leu He Leu Pro Leu 

960 965 970 

cat gga etg agg aac gtc aaa gee ate gac Ut gac cea etg gac aag 3037 
His Gly Leu Axg Asn Val Lys Ala He Asp Tyr Asp Pro Leu Asp Lys 
975 980 985 
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ttc ate tac tgg gtg gat ggg cgc cag aac ate aag ega gee aag gac 3085 
Phe He Tyi Tip Val Asp Gly Arg Gin Asn He Lys Arg Ala Lys Asp 
590 995 1000 1005 

gac ggg ace eag ecc ttt gtt ttg ace tct ctg age eaa ggc caa aae 3133 
Asp Gly Thr Gin Pro Phe Val Leu Thr Set Leu Ser Gin Gly Gin Asn 

1010 1015 1020 

cca gac agg cag ecc cac gac etc age ate gac ate tae age egg aca 3 181 
Pro Asp Arg Gin Pro His Asp Leu Ser He Asp He Tyr Ser Arg Thr 

1025 1030 1035 

ctg ttc tgg aeg tgc gag gee acc aat ace ate aae gtc eac agg ctg 3229 
Leu Phe Tip Thr Cys Glu Ala Thr Asn Thr He Asn Val His Arg Leu 

1040 1045 1050 

age ggg gaa gee atg ggg gtg gtg ctg cgt ggg gac cgc gac aag ecc 3277 
Ser Gly Glu Ala Met Gly Val Val Leu Axg Gly Asp Arg Asp Lys Pro 

1055 1060 1065 

agg gee ate gtc gtc aac gcg gag ega ggg tac ctg tae ttc ace aac 3325 
Arg Ala He Val Val Asn Ala Glu Arg Gly Tyr Leu Tyr Phe Thr Asn 
1070 1075 1080 1085 

atg cag gac egg gca gee aag ate gaa cgc gca gcc ctg gac ggc acc 3373 
Met Ghi Asp Arg Ala Ala Lys De Glu Arg Ala Ala Leu Asp Gly Thr 

1090 1095 1100 

gag cgc gag gtc etc ttc ace acc ggc etc ate cgc cct gtg gcc ctg 3421 
Glu Arg Glu Val Leu Phe Thr Thr Gly Leu He Arg Pro Val Ala Leu 
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1105 1110 1115 

gtg gtg gac aac aca ctg ggc aag ctg ttc tgg gtg gac gcg gac ctg 3469 
Val Val Asp Asn Thr Leu Gly Lys Leu Phe Trp Val Asp Ala Asp Leu 

1120 1125 1130 

aag cgc att gag age tgt gac ctg tea ggg gee aac cgc ctg ace ctg 3517 
Lys Arg He Glu Ser Cys Asp Leu Ser Gly Ala Asn Arg Leu Thi Leu 

1135 1140 1145 

gag gac gcc aac ate gtg cag cct ctg ggc ctg acc ate ctt ggc aag 3565 
Glu Asp Ala Asn He Val Gin Pro Leu Gly Leu Thr He Leu Gly Lys 
1150 1155 1160 1165 

cat etc tac tgg ate gac cgc cag cag cag atg ate gag cgt gtg gag 3613 
His Leu Tyr Tip He Asp Arg Gin Gin Gin Met lie Glu Arg Val Glu 

1170 1175 1180 

aag acc acc ggg gac aag egg act cgc ate cag ggc cgt gtc gcc cac 3661 
Lys Thr Thr Gly Asp Lys Arg Thr Arg He Gin Gly Arg Val Ala His 

1185 1190 - 1195 

etc act ggc ate cat gca gtg gag gaa gtc age ctg gag gag ttc tea 3709 
Leu Thr Gly He His Ala Val Glu Glu Val Ser Leu Glu Glu Phe Ser 

1200 1205 1210 

gcc cac cca tgt gcc cgt gac aat ggt ggc tgc tec cac ate tgt att 3757 
Ala His Pro Cys Ala Arg Asp Asn Gly Gly Cys Ser His He Cys He 

1215 1220 1225 

gcc aag ggt gat ggg aca cca egg tgc tea tgc cca gtc cac etc gtg 3805 
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Ala Lys Gly Asp Gly Thr Pro Arg Cys Ser Cys Pro Val His Leu Val 
1230 1235 1240 1245 

etc ctg cag aac ctg ctg acc tgt gga gag ccg ccc acc tgc tec ecg 3853 
Leu Leu Gin Asn Leu Leu Thr Cys Gly Glu Pro Pro Thr Cys Ser Pro 

1250 1255 1260 

gac cag ttt gca tgt gcc aca ggg gag ate gac tgt ate ecc ggg gee 3901 
Asp Gin Phe Ala Cys Ala Thr Gly Glu Be Asp Cys He Pro Gly Ala 

1265 1270 1275 

tgg cgc tgt gac ggc ttt ccc gag tgc gat gac cag age gac gag gag 3949 
Trp Arg Cys Asp Gly Phe Pro Glu Cys Asp Asp Gin Ser Asp Glu Glu 

1280 1285 1290 

ggc tgc ccc gtg tgc tec gee gee cag ttc ccc igc gcg egg ggt cag 3997 
Gly Cys Pro Val Cys Ser Ala Ala Gin Phe Pro Cys Ala Arg Gly Gin 

1295 1300 1305 

tgt gtg gac ctg cgc ctg cgc tgc gac ggc gag gca gac tgt cag gac 4045 
Cys Val Asp Leu Arg Leu Arg Cys Asp Gly Glu Ala Asp Cys Gin Asp 
1310 1315 1320 1325 

cgc tea gac gag gtg gac tgt gac gcc ate tgc ctg ccc aac cag ttc 4093 
Arg Ser Asp Glu Val Asp Cys Asp Ala lie Cys Leu Pro Asn Ghi Phe 

1330 1335 1340 

egg tgt gcg age ggc cag tgt gtc etc ate aaa cag cag tgc gac tee 4141 
Arg Cys Ala Ser Gly Gin Cys Val Leu He Lys Gin Gin Cys Asp Ser 
1345 1350 1355 
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ttc CO gac tgt ^tc gac ggc tec gae gag etc atg tgt gaa ate aco 4189 
Ph. Pro Asp Cys nc Gly S« Asp Glu Met Cys Glu n. Tbx 

1360. 1365 1370 
aag eeg eee tea gae gae age ceg gee eae age agt gee ate ggg eee 4237 
LysPro ProSer Asp Asp Set Pro AlaHis Se, Ser Alalle Gly Pro 

1375 1380 

gtc att ggc ate ate ete tet cte tte gte atg ggt ggt gtc tat ttt 4285 
Val ne Gly He He Leu Ser Leu Phe Val Met Gly Gly Val Tyr Phe 
1390 1395 1400 1405 

gtg tgc eag cgc gtg gtg tgc cag cgc tat geg ggg gcc aac ggg ccc 4333 
Val Cys Gla Axg Val Val Cys Gin Axg Tyr Ala Gly Ala Asu Gly Pro 

1410 1415 1420 

tte eeg eac gag tat gtc age ggg ace eeg eae gtg eee cte aat tte 4381 
Phe Pro His Glu I^r Val Ser Gly Thr Pro His Val Pro Leu Asn Phe 

1425 1430 1435 

ata gee eeg ggc ggt tec cag cat ggc eee tte aea ggc ate gea tgc 4429 
ne Ala Pro Gly Gly Ser Gin His Gly Pro Phe Thr Gly Be Ala Cys 

1440 .1445 1450 

gga aag tee atg atg age tec gtg age ctg atg ggg ggc egg ggc ggg 4477 
Jly Lys Ser Met Met Ser Ser Val Ser Leu Met Gly Gly Arg Gly Gly 

1455 1460 1465 

gtg eee etc tac gae egg aac eae gte aea ggg gcc teg tec age age 4525 
val Pro Leu Tyr Asp Arg Asn His Val Thr Gly Ala Ser Ser Ser Ser 
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1470 1475 1480 1485 

teg tec age acg aag gcc acg ctg tac ccg ccg ate ctg aac cog ccg 4573 

Ser Ser Ser Thr Lys Ala Thr Leu Tyr Pro Pro De Leu Asn Pro Pro 

1490 1495 1500 

ccc tec ccg gcc acg gac ccc tec ctg tac aac atg gac atg ttc tac 4621 
Pro Ser Pro Ala Thr Asp Pro Ser Leu Tyr Asn Met Asp Met Phe Tyr 

1505 1510 1515 

tct tea aac att ccg gcc act gcg aga ccg tac agg ccc tac ate att 4669 
Ser Ser Asn He Pro Ala Thr Ala Arg Pro Tyr Arg Pro Tyr De He 

1520 1525 1530 

cga gga atg gcg ccc ccg acg acg ccc tgc age acc gac gtg tgt gac 4717 
Arg Gly Met Ala Pro Pro Thr Thr Pro Cys Ser Thr Asp Val Cys Asp 
1535 1540 1545 

age gac tac age gcc age cgctgg aag gcc age aag tac tac ctg gat 4765 
Ser Asp Tyr Ser Ala Ser Arg Trp Lys Ala Ser Lys Tyr Tyr Leu Asp 
1550 1555 1560 1565 

ttg aac teg gac tea gac ccc tat cca ccc cca ccc acg ccc cac age 4813 
Leu Asn Ser Asp Ser Asp Pro Tyr Pro Pro Pro Pro Thr Pro His Ser 

1570 1575 1580 

cag tac ctg teg gcg gag gac age tgc ccg ccc teg ccc gcc acc gag 4861 
Gin Tyr Leu Ser Ala Glu Asp Ser Cys Pro Pro Ser Pro Ala Thr Glu 

1585 1590 1595 

agg age tac ttc cat etc ttc ccg ccc cct ccg tec ccc tgc acg gac 4909 
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Arg Ser Tyr Plie His Leu Plie Pro Pro Pro Pro Ser Pro Cys Thr Asp 

1600 1605 1610 

tea tec tgacctcggc cgggccactc tggcttctct gtgcccctgt aaatagtttt 4965 
Ser Ser 
1615 

aaatatgaac aaagaaaaaa atatatttta tgatttaaaa aataaatata attgggattt 5025 
taaaaacatg agaaatgtga actgtgatgg ggtgggeagg gctgggagaa ctttgtacag 5085 
tggagaaata tttataaact taattttgta aaaca 5 120 

<210> 2 

<211> 5120 

<212> DNA 

< 213 > Homo sapiens 



<400> 2 

actaaagcgc cgccgccgcg ccatggagcc cgagtgagcg cggcgcgggc ccgtccggcc 60 
gecggacaac atg gag gca gcg ccg ccc ggg ccg ccg tgg ccg ctg ctg 109 
Met Glu Ala Ala Pro Pro Gly Pro Pro Trp Pro Leu Leu 
15 io 
ctg ctg ctg ctg ctg ctg ctg gcg ctg tgc ggc tgc ccg gee ccc gcc 157 
Leu Leu Leu Leu Leu Leu Leu Ala Leu Cys Gly Cys Pro Ala Pro Ala 

15 20 25 

gcg gcc teg ccg etc ctg eta ttt gcc aac cgc cgg^ac gta egg ctg 205 
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Ala Ala Ser Pro Leu Leu Leu Phe Ala Asn Arg Arg Asp Val Arg Leu 
30 35 40 45 

gtg gac gcc ggc gga gtc aag ctg gag tec acc ate gtg gte age ggc 253 
Val Asp Ala Gly Gly Val Lys Leu Glu Ser Thr lie Val Val Ser Gly 

50 55 60 

c^ gag gat gcg gcc gca gtg gac tte cag ttt tee aag gga gee gtg 301 
Leu Glu Asp Ala Ala Ala Val Asp Phe Gin Phe Ser Lys Gly Ala Val 

65 70 75 

tac tgg aca gac gtg age gag gag gcc ate aag cag ace tac ctg aac 349 
Tyr Tip Thr Asp Val Ser Glu Glu Ala Jle Lys Gin Thr Tyr Leu Asn 

80 85 90 

cag aeg ggg gcc gcc gtg cag aac gtg gtc ate tec ggc ctg gtc tct 397 
Gin Thr Gly Ala Ala Val Gin Asn Val Val lie Ser Gly Leu Val Ser 

95 100 105 

ecc gac ggc etc gcc tgc gac tgg gtg ggc aag aag ctg tac tgg acg 445 
Pro Asp Gly Leu Ala Cys Asp Tip Val Gly Lys Lys Leu Tyr Trp Thr 
110 115 120 125 

gac tea gag ace aac cge ate gag gtg gee aac etc aat ggc aca tec 493 
Asp Ser Glu Thr Asn Arg Jle Glu Val Ala Asn Leu Asn Gly Thr Ser 

130 135 140 

egg aag gtg etc tte tgg cag gac ctt gae cag ccg agg gcc ate gee 541 
Arg Lys Val Leu Phe Tip Gin Asp Leu Asp Ghi Pro Arg Ala He Ala 
145 150 155 
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ttg gac ccc get cac ggg tac atg tac tgg aca gac tgg gtt gag acg 589 
Leu Asp Pro Ala His Gly Tyr Met Tyr Trp Thr Asp Trp Val Glu Thr 

160 165 170 

ccc egg att gag egg gca ggg atg gat gge age ace egg aag ate att 637 
Pro Arg He Glu Arg Ala Gly Met Asp Gly Set Thr Arg Lys He lie 

175 180 185 

gtg gac teg gac att tac tgg ccc aat gga ctg acc ate gac ctg gag 685 
Val Asp Ser Asp lie Tyr Trp Pro Asn Gly Leu Thr ne Asp Leu Glu 
190 195 200 205 

gag cag aag etc tac tgg get gac gcc aag etc age ttc ate cac cgt 733 
Glu Ghi Lys Leu Tyr Trp Ala Asp Ala Lys Leu Ser Phe He His Arg 

210 215 220 

gee aac ctg gac gge teg ttc egg cag aag gtg gtg gag ggc age ctg 781 
Ala Asn Leu Asp Gly Ser Phe Arg Ghi Lys Val Val Glu Gly Ser Leu 

225 230 235 

acg cac ccc ttc gee ctg acg etc tec ggg gac act ctg tac tgg aca 829 
Thr His Pro Phe Ala Leu Thr Leu Ser Gly Asp Thr Leu Tyr Trp Thr 

240 245 250 

gac tgg cag acc cgc tec ate cat gcc tge aac aag cge act ggg ggg 877 
Asp Tip Gin Thr Arg Ser He His Ala Cys Asn Lys Arg Thr Gly Gly 

255 260 265 

aag agg aag gag ate ctg agt gcc etc tac tea ccc atg gac ate cag 925 
Lys Arg Lys Glu He Leu Ser Ala Leu Tyr Ser Pro Met Asp lie Gin 
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270 275 280 285 

gtg ctg age cag gag egg eag ect ttc tte eac act ego tgt gag gag 973 

Val Leu Ser Gin Glu Arg Gin Pro Phe Phe His Thr Arg Cys Glu Glu 

290 295 300 

gac aat ggc ggc tgc tec cac ctg tgc ctg ctg tec cea age gag ect 1021 
Asp AsD Gly Gly Cys Ser His Leu Cys Leu Leu Ser Pro Ser Glu Pro 

305 310 315 

ttc tac aca tgc gee tgc cec acg ggt gtg cag ctg cag gac aac ggc 1069 
Phe Tyr Thr Cys Ala Cys Pro Thr Gly Val Gin Leu Gin Asp Asn Gly 

320 325 330 

agg acg tgt aag gca gga gee gag gag gtg ctg ctg ctg gee egg egg 1117 
Arg Thr Cys Lys Ala Gly Ala Glu Glu Val Leu Leu Leu Ala Arg Arg 

335 340 345 

acg gac eta egg agg ate teg ctg gac acg ecg gac ttc ace gac ate 1165 
Thr Asp Leu Arg Arg Tie Ser Leu Asp Thi Pro Asp Phe Thr Asp He 
350 355 360 365 

gtg ctg cag gtg gac gac ate egg cac gee att gee ate gac tac gac 1213 
Val Leu Ghi Val Asp Asp He Arg His Ala De Ala lie Asp Tyr Asp 

370 375 380 

ecg eta gag ggc tat gtc tac tgg aca gat gac gag gtg egg gee ate 1261 
Pro Leu Glu Gly Tyr Val Tyr Trp Thr Asp Asp Glu Val Arg Ala Be 

385 390 395 

cgc agg geg tac ctg gae ggg tet ggg gcg cag acg ctg gtc aac acc 1309 
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Arg Arg Ala Tyr Leu Asp Gly Ser Gly Ala Gin Thr Leu Val Asn Thr 

400 405 410 

gag ate aac gac ccc gat ggc ate gcg gtc gac tgg gtg gcc cga aac 1357 
Glu ne Asn Asp Pro Asp Gly fle Ala Val Asp Trp Val Ala Arg Asn 

415 420 425 

etc tac tgg acc gac acg ggc acg gac cgc ate gag gtg acg cgc etc 1405 
Leu Tyr Trp Thr Asp Thr Gly Thr Asp Arg He Glu Val Thr Arg Leu 
430 435 440 445 

aac ggc acc tec cgc aag ate ctg gtg teg gag gac ctg gac gag ccc 1453 
Asn Gly Thr Ser Arg Lys He Leu Val Ser Glu Asp Leu Asp Glu Pro 

450 455 460 

cga gee ate gca ctg cac ccc gtg atg ggc etc atg tac tgg aca gac 1501 
Arg Ala He Ala Leu His Pro Val Met Gly Leu Met Tyr Trp Thr Asp 

465 470 475 

tgg gga gag aac cct aaa ate gag tgt gcc aac ttg gat ggg cag gag 1549 
Trp Gly Glu Asn Pro Lys lie Glu Cys Ala Asn Leu Asp Gly Ghi Glu 

480 485 490 

egg cgt gtg ctg gtc aat gee tec etc ggg tgg ccc aac ggc ctg gee 1597 
Arg Arg Val Leu Val Asn Ala Ser Leu Gly Trp Pro Asn Gly Leu Ala 

495 500 505 

ctg gac ctg cag gag ggg aag etc tac tgg gga gac gcc aag aca gac 1645 
Leu Asp Leu Gin Glu Gly Lys Leu Tyr Trp Gly Asp Ala Lys Thr Asp 
510 515 520 525 
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aag ate gag gtg ate aat gtt gat ggg aeg aag agg egg aec etc ctg 1693 
Lys De Glu Val He Asn Val Asp Gly Thr Lys Arg Arg Thr Leu Leu 

530 535 540 

gag gac aag etc ccg eac att ttc ggg ttc acg ctg ctg ggg gac ttc 1741 
Glu Asp Lys Leu Pro His He Phe Gly Phe Thr Leu Leu Gly Asp Phe 

545 550 555 

ate tac tgg act gac tgg eag ege cgc age ate gag egg gtg cae aag 1789 
He Tyr Trp Thr Asp Tip Gin Arg Arg Ser He Glu Arg Val His Lys 

560 565 570 

gte aag gee age egg gac gtc ate att gac eag ctg ecc gac ctg atg 1837 
Val Lys Ala Ser Arg Asp Val He He Asp Gin Leu Pro Asp Leu Met 

575 580 585 

ggg etc aaa get gtg aat gtg gee aag gte gtc gga ace aac ccg tgt 1885 
Gly Leu Lys Ala Val Asn Val Ala Lys Val Val Gly Thr Asn Pro Cys 
590 595 600 605 

geg gac agg aac ggg ggg tge age cac ctg tgc ttc ttc aca ccc cae 1933 
Ala Asp Arg Asn Gly Gly Cys Ser His Leu Cys Phe Phe Thr Pro His 

610 615 620 

gea ace egg tgt ggc tge ccc ate gge ctg gag ctg ctg agt gac atg 198 1 
Ala Thr Arg Cys Gly Cys Pro He Gly Leu Glu Leu Leu Ser Asp Met 

625 630. 635 

aag ace tgc ate gtg ect gag gee ttc ttg gte ttc aec age aga gee 2029 
Lys Thr Cys He Val Pro Glu Ala Phe Leu Val Phe Thr Ser Arg Ala 



PCTAJSOl/16946 



wo 01/92891 PCT/USOl/16946 

21 



640 645 650 

gcc ate cac agg ate tec etc gag aec aat aac aac gac gtg gee ate 2077 
Ala He His Arg He Ser Leu Glu Thr Asn Asn Asa Asp Val Ala He 

655 660 665 

ccg etc acg ggc gtc aag gag gcc tea gcc ctg gac ttt gat gtg tec 2125 
Pro Leu Thr Gly Val Lys Glu Ala Ser Ala Leu Asp Plie Asp Val Ser 
670 675 680 685 

aac aac cac ate tac tgg aca gac gtc age ctg aag aec ate age egc 2173 
Asn Asn His He Tyr Trp Thr Asp Val Ser Leu Lys Thr He Ser Arg 

690 695 700 

gee tte atg aac ggg age teg gtg gag cac gtg gtg gag ttt ggc ett 2221 
Ala Phe Met Asn Gly Ser Ser Val Glu His Val Val Glu Phe Gly Leu 

705 710 715 

gac tac ccc gag ggc atg gee gtt gac tgg atg ggc aag aac etc tac 2269 
Asp Tyr Pro Glu Gly Met Ala Val Asp Trp Met Gly Lys Asn Leu Tyr 

720 725 730 

tgg gcc gac act ggg acc aac aga ate gaa.gtg gcg egg ctg gac ggg 2317 
Trp Ala Asp Thr Gly Thr Asn Arg De Glu Val Ala Arg Leu Asp Gly 

735 740 745 

cag ttc egg caa gtc etc gtg tgg agg gac ttg gac aac ccg agg teg 2365 
Glu Phe Arg Ghi Val Leu Val Trp Arg Asp Leu Asp Asn Pro Arg Ser 
750 755 760 765 

ctg gcc ctg gat ccc acc aag ggc tac ate tac tgg acc gag tgg ggc 2413 



wo 01/92891 

22 



Leu Ala Leu Asp Pro Thr Lys Gly Tyr lie Tyr Tip Thr Glu Trp Gly 

770 775 780 

ggc aag ccg agg ate gtg egg gee tie atg gae ggg ace aac tge atg 2461 
Gly Lys Pro Arg Be Val Arg Ala Phe Met Asp Gly Thr Asn Cys Met 

785 790 795 

acg ctg gtg gae aag gtg ggc egg gee aac gae etc aec att gae tae 2509 
Thr Leu Val Asp Lys Val Gly Arg Ala Asn Asp Leu Thr lie Asp Tyr 

800 805 810 

get gae eag egc etc tae tgg acc gae etg gae aec aac atg ate gag 2557 
Ala Asp Gin Arg Leu Tyr Trp Thr Asp Leu A^ Thr Asn Met He Glu 

815 820 825 

teg tee aac atg ctg ggt eag gag egg gtc gtg att gee gae gat etc 2605 
Ser Ser Asn Met Leu Gly Gin Glu Arg Val Val He Ala Asp Asp Leu 
830 835 840 845 

ccg cae ccg ttc ggt ctg acg eag tae age gat tat ate tae tgg aca 2653 
Pro His Pro Phe Gly Leu Thr Gin Tyr Ser Asp Tyr He Tyr Trp Thr 

850 855 860 

gae tgg aat etg cac age att gag egg gee gae aag act age ggc egg 2701 
Asp Trp Asn Leu His Ser He Glu Arg Ala Asp Lys Thr Ser Gly Arg 

865 870 875 

aac egc aec etc ate eag ggc cac ctg gae ttc gtg atg gae ate ctg 2749 
Asn Arg Thr Leu He Ghi Gly His Leu Asp Phe Val Met Asp He Leu 
880 885 890 
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gtg ttc cac tec tec cgc cag gat ggc etc aat gac tgt atg cac aac 2797 
Val Phe His Ser Ser Arg Gin Asp Gly Leu Asn Asp Cys Met His Asn 

895 900 905 

aac ggg cag tgt ggg cag ctg tgc ctt gee ate cee ggc ggc cac cgc 2845 
Asn Gly Gin Cys Gly Gin Leu Cys Leu Ala He Pro Gly Gly His Arg. . 
910 915 920 925 

tgc ggc tgc gee tea cac tac ace ctg gac ccc age age cgc aac tgc 2893 
Cys GJy Cys Ala Ser His Tyr Thr Leu Asp Pro Ser Ser Arg Asn Cys 

930 935 940 

age ccg cec ace ace ttc ttg ctg ttc age cag aaa tct gee ate agt 2941 
Ser Pro Pro.Thr Thr Phe Leu Leu Phe Ser Ghi Lys Ser Ala He Ser 

945 950 955 

egg atg ate ccg gac gac cag cac age ccg gat etc ate ctg cec ctg 2989 
Arg Met lie Pro Asp Asp Ghi His Ser Pro Asp Leu De Leu Pro Leu 

960 965 970 

cat gga ctg agg aac gtc aaa gee ate gac tat gac cea ctg gap aag 3037 
His Gly Leu Arg Asn Val Lys Ala He Asp Tyr Asp Pro Leu Asp Lys 

975 980 985 

ttc ate tac tgg gtg gat ggg cgc cag aac ate aag ega gee aag gac 3085 
Phe ne Tyr Trp Val Asp Gly Arg Ghi Asn lie Lys Arg Ala Lys Asp 
990 995 1000 1005 

gac ggg ace cag cec ttt gtt ttg ace tct ctg age caa ggc caa aac 3 133 
Asp Gly Thr Gin Pro Phe Val Leu Thr Ser Leu Ser Gin Gly Ghi Asn 
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1010 . 1015 1020 

cca gac agg cag ccc cac gac etc age ate gae ate tac age egg aca 3181 
Pro Asp Arg Gin Pro His Asp Leu Ser lie Asp lie Tyr Ser Arg Thr 

1025 1030 1035 

ctg ttc tgg acg tgc gag gee ace aat ace ate aac gtc cac agg ctg 3229 
Leu Phe Tip TTir Cys Glu Ala Thr Asn Thr lie Asn Val His Arg Leu 

1040 1045 1050 

age ggg gaa gcc atg ggg gtg gtg ctg cgt ggg gac cgc gac aag ccc 3277 
Ser Gly Glu Ala Met Gly Val Val Leu Arg Gly Asp Arg Asp Lys Pro 

1055 1060 1065 

agg gcc ate gtc gtc aac gcg gag cga ggg tac ctg tac ttc aec aac 3325 
Arg Ala lie Val Val Asn Ala Glu Arg Gly Tyr Leu Tyr Phe Thr Asn 
1070 ' 1075 1080 1085 

atg cag gae egg gea gee aag ate gaa cgc gca gee etg gae gge acc 3373 
Met GlQ Asp Axg Ala' Ala Lys lie Glu Arg Ala Ala Leu Asp Gly Thr 

1090 1095 1100 

gag cge gag gtc cte-ttc ace ace gge etc ate cgc cct gtg gee ctg 3421 
Glu Arg Glu Val Leu Phe Thr Thr Gly Leu He Arg Pro Val Ala Leu 

1105 1110 1115 

gtg gtg gae aac aca etg gge aag etg ttc tgg gtg gac gcg gac ctg 3469 
Val Val Asp Asn Thr Leu Gly Lys Leu Phe Trp Val Asp Ala Asp Leu 

1120 1125 1130 

aag cgc att gag age tgt gae etg tea ggg gcc aac cgc ctg ace ctg 35 17 
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Lys Axg ne Glu Ser Cys Asp Leu Set Gly Ala Asn Arg Leu ^ Leu 
1140 1145 



1135 



3565 



3613 



gag gac gcc aac ate gtg cag cot ctg ggc ctg ace ate ctt ggc aag 
Glu ASP Ala Asn lie Val Gin Pro Leu Gly Leu Thr He Leu Gly Lys 

1150 1155 1160 1165 

cat etc tac tgg ate gac cge cag cag cag atg ate gag cgt gtg gag 

His Leu Tyr Trp He Asp Arg Gin Gin Gin Met He Glu Arg Val Glu 

1170 1175 1180 

aag acc acc ggg gac aag egg act cgc ate cag ggc cgt gtc gcc cac 3661 
Lys Thr Tl^ Gly Asp Lys Arg THr Axg lie Gin Gly Arg val Ala His 

1185 1190 1195 

etc act ggc ate cat gca gtg gag gaa gtc age ctg gag gag ttc tea 3709 
Leu Thr Gly He His Ala Val Glu Glu Val Ser Leu Glu Glu Phe Ser 

1200 1205 1210 

gcc cac eca tgt gcc cgt gac aat ggt ggc tge tec cac ate tgt att 3757 
lla His Pro Cys Ala Arg Asp Asn Gly Gly Cys Ser His He Cys lie 

1215 1220 1225 

gcc aag ggt gat ggg aea cea egg tge tea tge eca gtc cac etc gtg 3805 
Ala Lys Gly Asp Gly Arg Cys Ser Cys Pro Val His Leu Val 

1230 1235 1240 1245 

etc ctg cag aac ctg ctg acc tgt gga gag ccg ccc acc tge tec ccg 
Leu Leu Gin Asn Leu Leu m Cys Gly Glu Pro Pro thr Cys Ser Pro 
1250 1255 1260 



3853 
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gac cag ttt gca tgt gcc aca ggg gag ate gac tgt ate ceo ggg gee 3901 
Asp Gin Phe Ala Cys Ala Thr Gly Glu Be Asp Cys He Pro Gly Ala 

1265 1270 1275 

tgg cgc tgt gae ggc ttt ccc gag tgc gat gae cag age gac gag gag 3949 
Tip Arg Cys Asp Gly Phe Pro Glu Cys Asp Asp Gin Ser Asp Glu Glu 

1280 1285 1290 

ggc tgc ccc gtg tgc tec gcc gcc cag ttc ccc tgc gcg egg ggt cag 3997 
Gly Cys Pro Val Cys Ser Ala Ala Gin Phe Pro Cys Ala Arg Gly Gin 

1295 1300 1305 

tgt gtg gac ctg cgc ctg ege tgc gae ggc gag gca gac tgt cag gac 4045 
Cys Val Asp Leu Arg Leu Arg Cys Asp Gly Glu Ala Asp Cys Gin Asp 
1310 1315 1320 1325 

cgc tea gac gag gtg gac tgt gac gee ate tgc ctg cce aac cag ttc 4093 
Arg Ser Asp Glu Val Asp Cys Asp Ala De Cys Leu Pro Asn Gin Phe 

1330 1335 1340 

egg tgt gcg age ggc cag tgt gtc etc ate aaa cag cag tgc gac tee 4141 
Arg Cys Ala Ser Gly Ghi Cys Val Leu Be Lys Gin Gin Cys Asp Ser 

1345 1350 1355 

ttc cee gac tgt ate gac ggc tec gac gag etc atg tgt gaa ate acc 4189 
Phe Pro Asp Cys De Asp Gly Ser Asp Glu Leu Met Cys Glu lie Thr 

1360 1365 1370 

aag ccg ccc tea gac gac age ccg gcc cac age agt gcc ate ggg ecc 4237 
Lys Pro Pro Ser Asp Asp Ser Pro Ala His Ser Ser Ala He Gly Pro 
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1375 1380 1385 

gtc att ggc ate ate etc tct cte tte gtc atg ggt ggt gte tat ttt 4285 
Val He Gly He lie Leu Ser Leu Phe Val Met Gly Gly Val Tyr Phe 
1390 1395 1400 1405 

gtg tgc cag cgc gtg gtg tgc eag cgc tat gcg ggg gcc aac ggg ccc 4333 
Val Cys Gin Arg Val Val Cys Gin Arg Tyr Ala Gly Ala Asn Gly Pro 

1410 1415 1420 

ttc ecg cac gag tat gte age ggg ace ccg cae gtg cec etc aat tte 4381 
Phe Pro His Glu Tyr Val Ser Gly Thr Pro His Val Pro Leu Asn Phe 

1425 1430 1435 

ata gcc ccg ggc ggt tec cag cat ggc ccc ttc aca ggc ate gca tgc • 4429 
ne Ala Pro Gly Gly Ser Gin His Gly Pro Phe Thi Gly lie Ala Cys 

1440 1445 1450 

gga aag tec atg atg age tee gtg age ctg atg ggg ggc egg ggc ggg 4477 
Gly Lys Ser Met Met Ser Ser Val Ser Leu Met Gly Gly Arg Gly Gly 

1455 1460 1465 

gtg ccc etc tac gac egg aac cae gtc aca ggg gcc teg tec age age 4525 
Val Pro Leu Tyr Asp Arg Asn His Val Thr Gly Ala Ser Ser Ser Ser 
1470 1475 1480 1485 

teg tec age acg aag gcc acg ctg tac ecg ccg ate ctg aac ecg ecg 4573 
Ser Ser Ser Thr Lys Ala Thr Leu Tyr Pro Pro He Leu Asn Pro Pro 

1490 1495 1500 

ccc tec ccg gee acg gac ccc tec ctg tac aac atg gac atg tte tac 4621 
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Pro Ser Pro Ala Thr Asp Pro Ser Leu Tyr Asn Met Asp Met Phe Tyr 

1505 1510 1515 

tct tea aac att ccg gcc act gcg aga ccg tac agg ccc tac ate att 4669 
Ser Ser Asn He Pro Ala Thr Ala Arg Pro Tyr Arg Pro Tyr lie lie 

1520 1525 1530 

cga gga atg gcg ccc ccg acg acg ccc tgc age acc gac gtg tgt.gae 4717 
Arg Gly Met Ala Pro Pro Thr Thr Pro Cys Ser Thr Asp Val Cys Asp 

1535 1540 1545 

age gac tac age gcc age cgc tgg aag gcc age aag tac tac ctg gat 4765 
Ser Asp Tyr Ser Ala Ser Arg Trp Lys Ala Ser Lys Tyr Tyr Leu Asp 
1550 1555 1560 1565 

ttg aac teg gac tea gac ccc tat cea ccc cca ccc acg ccc cac age 4813 
Leu Asn Ser Asp Ser Asp Pro Tyr Pro Pro Pro Pro Thr Pro His Ser 

1570 1575 1580 

cag tac etg teg gcg gag gac age tgc ccg ccc teg cee gee ace gag 4861 
Gin Tyr Leu Ser Ala Glu Asp Ser Cys Pro Pro Ser Pro Ala Thr Glu 

1585 1590 1595 

agg age tac tte cat etc ttc ccg ccc ect ccg tec ccc tgc acg gac 4909 
Arg Ser Tyr Phe His Leu Phe Pro Pro Pro Pro Ser Pro Cys Thr Asp 

1600 1605 1610 

tea tee tgacctcggc cgggecacte tggettetet gtgcccctgt aaatagtttt 4965 
Ser Ser 
1615 
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aaamtgaac aaaga^ atatatttta tgamaaaa aa.aa3.aJa .ttgggam 
taaaaacatgagaaa««aaCg,ga«ggg«ggcagEECgggagaao«gmcag 5083 

5120 

tggagaaata tttataaact taattttgta aaaca 



<21G> 3 
<211> 1615 
<212> PRT 
<213> Homo sapiens 



<400 > 3 

Met Glu Ala Ala Pro Pro Gly Pro Pro Trp Pro I^u I^u Leu Leu Leu 
15 10 15 

Leu Leu Leu Leu Ala Leu Cy s Gly Cys Pro Ala Pro Ala Ala Ala Ser 

20 25 30 

Pro Leu Leu Leu Phe Ala Asn Arg Arg Asp Val Arg Leu Val Asp Ala 

35 40 45 

Gly Gly Val Lys Leu Glu Ser TTnr He Val Val Ser Gly Leu Glu Asp 
55 60 



50 



Ala Ala Ala Val 



Asp Phe Gin Phe Ser Lys Gly Ala Val Tyr Tip Thr 



65 



70 



75 



80 



Glu Ala ne Lys Gin Thr Tyx Leu Asn Ghi Thr Gly 

95 



Asp Val Ser Glu 

85 90 
Ala Ala Val Ghi Asn Val Val fle Ser Gly Leu Val Ser Pro Asp Gly 
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100 105 110 

Leu Ala Cys Asp Tip Val Gly Lys Lys Leu Tyr Tip Thr Asp Ser Glu 

115 120 125 

Thr Asn Arg De Glu Val Ala Asn Leu Asn Gly Thr Ser Arg Lys Val 

130 135 140 

Leu Phe Tip Ghi Asp Leu Asp Ghi Pro Lys Ala He Ala Leu Asp Pro 

150 155 160 

Ala His Gly Tyr Met Tyr Tip Thr Asp Trp Gly Glu Thr Pro Arg He 

165 170 . 175 

Glu Arg Ala Gly Met Asp Gly Ser Thr Arg Lys De He Val Asp Ser 

180 185 190 

Asp He Tyr Tip Pro Asn Gly Leu Thr He Asp Leu Glu Glu Gin Lys 

195 200 205 

Leu Tyr Tip Ala Asp Ala Lys Leu Ser Phe De His Arg Ala Asn Leu 

210 215 220 

Asp Gly Ser Phe Arg Ghi Lys Val Val Glu Gly Ser Leu Thr His Pro 
225 230 235 240 

Phe Ala Leu Thr Leu Ser Gly Asp Thr Leu Tyr Trp Thr Asp Tip Ghi 
245 250 255 

Thr Arg Ser He His Ala Cys Asn Lys Arg Thr Gly Gly Lys Arg Lys 
260 265 270 

Glu ne Leu Ser Ala Leu Tyr Ser Pro Met Asp De GM Val Leu Ser 
275 280 285 
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Gin Glu Arg Gin Pro Phe Phe His Thr Arg Cys Glu Glu Asp Asn Gly 

290 295 300 

Gly Trp Ser His Leu Cys Leu Leu Ser Pro Ser Glu Pro Phe Tyr Thr 

305 310 315 320 

Cys Ala Cys Pro Thr Gly Val Gin Met GlQ Asp Asn Gly Arg Thr Cys 

325 330 335 

Lys Ala Gly Ala Glu Glu Val Leu Leu Leu Ala Arg Arg Thr Asp Leu 

340 345 350 

Arg Arg He Ser Leu Asp Thr Pro Asp Phe Thr Asp lie Val Leu GM 

355 360 365 

Val Asp Asp lie Arg His Ala He Ala He Asp Tyr Asp Pro Leu Glu 

370 375 380 

Gly Tyr Val Tyr Trp Thr Asp Asp Ghi Val Arg Ala He Arg Arg Ala 
385 390 395 400 

Tyr Leu Asp Gly Ser Gly Ala Gin Thr Leu Val Asn Thr Glu He Asn 

405 410 415 

Asp Pro Asp Gly lit Ala Val Asp Trp Val Ala Arg Asn Leu Tyr Trp 

420 425 430 

Thr Asp Thr Gly Thr Asp Arg He Glu Val Thr Arg Leu Asn Gly Thr 

435 440 445 

Ser Arg Lys He Leu Val Ser Glu Asp Leu Asp Glu Pro Arg Ala He 

450 455 460 

Ala Leu His Pro Val Met Gly Leu Met Tyr Trp Thr Asp Trp Gly Glu 
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465 470 475 480 

Asn Pro Lys lie Glu Cys Ala Asn Leu Asp Gly Gin Glu Arg Arg Val 

485 490 495 

Leu Val Asn Ala Ser Leu Gly Trp Pro Asn Gly Leu Ala Leu Asp Leu 

500 505 510 

Gin Glu Gly Lys Leu Tyr Trp Gly Asp Ala Lys Thr Asp Lys De Glu 

515 520 525 

Val He Asn Val Asp Gly Thr Lys Arg Arg Thr Leu Leu Glu Asp Lys 

530 535 540. 

Leu Pro His He Hie Gly Phe Thr Leu Leu Gly Asp Phe lie Trp 
545 550 555 560 

Thr Asp Trp Ghi Arg Arg Ser He Glu Arg Val His Lys Val Lys Ala 

565 570 575 

Ser Arg Asp Val lie He Asp Ghi Leu Pro Asp Leu Met Gly Leu Lys 

580 585 590 

Ala Val Asn Val Ala Lys Val Val Gly Thr Asn Pro Cys Ala Asp Arg 

595 600 605 

Asn Gly Gly Cys Ser His Leu Cys Phe Phe Thr Pro His Ala Thr Arg 

610 615 620 

Cys Gly Cys Pro He Gly Leu Glu Leu Leu Ser Asp Met Lys Thr Cys 
625 630 635 640 

He Val Pro Glu Ala Phe Leu Val Phe Thr Ser Arg Ala Ala lie His 
645 650 655 



PCTAJSOl/16946 
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Arg ne Ser Leu Glu IHr Asu Asn Asn Asp Val Ala He Pro Leu Tbr 

660 665 670 

Gly Val Lys Glu Ala Ser Ala Leu Asp Phe Asp Val Ser Asn Asn His 

675 680 685 

ne Tyr Trp liar Asp Val Ser Leu Lys Asn He Ser Arg Ala Phe Met 

690 695 700 . 

Asn Gly Ser Ser Val Glu His Val Val Glu Phe Gly Leu Asp Tyr Pro 
705 710 715 720 

Glu Gly Met Ala Val Asp Trp Met Gly Lys Asn Leu Tyr Trp Ala Asp 

725 730 735 

Thx Gly ITK Asn Arg ne Glu Val Ala Arg Leu Asp Gly Gbi Phe Axg 

740 745 750 

Gin Val Leu Val Trp Arg Asp Leu Asp Asn Pro Arg Ser I^u Ala Leu 

755 760 765 

Asp Pro Tbr Lys Gly Tyr ne Tyr Trp m Glu Trp Gly Gly Lys Pro 

770 775 780 

Arg ne Val Arg Ala Phe Met Asp Gly Ihr Asn Cys Met Tbx Leu Val 
785 790 795 800 

Asp Lys Val Gly AT g Ala Asn Asp Leu Thr ne Asp Tyr Ala Asp Ghi 

805 810 
Arg Leu Tyr Trp Tl^ Asp Leu Asp Thr Asn Met He Glu Ser Ser Asn 

820 825 830 

Met Leu Gly Gbi Glu Arg V^ Val fle Ala Asp Asp Leu Pro His Pro 
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835 840 845 

Phe Gly Leu Thr Gin Tyr Ser Asp Tyr Tic Tyr Trp Thr Asp Tip Asn 

850 855 860 

Leu His Ser De GIu Arg Ala Asp Lys Thr Ser Gly Arg Asn Arg Thr 
865 870 875 880 

Leu De Gin Gly His Leu Asp Phe Val Met Asp He Leu Val Phe His 

885 890 895 

Ser Ser Arg Gin Asp Gly Leu Asn Asp Cys Met His Asn Asn Gly Gin 

900 905 910 

Cys Gly Gin Leu Cys Leu Ala He Pro.Gly Gly His Arg Cys Gly Cys 

915 920 925 

Ala Ser His Tyr Thr Leu Asp Pro Ser Ser Arg Asn Cys Ser Pro Pro 

930 935 940 

Thr Thr Phe Leu Leu Phe Ser Ghi Lys Ser Ala He Ser Arg Met De 
945 950 955 960 

Pro Asp Asp Ghi His Ser Pro Asp Leu De Leu Pro Leu His Gly Leu 

965 970 975 

Arg Asn Val Lys Ala De Asp Tyr Asp Pro Leu Asp Lys Phe De Tyr 

980 985 990 

Trp Val Asp Gly Arg Gin Asn De Lys Arg Ala Lys Asp Asp Gly Thr 

995 1000 1005 

Ghi Pro Phe Val Leu Thr Ser Leu Ser Ghi Gly Gin Asn Pro Asp Arg 
1010 1015 1020 
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Gin Pro His Asp Leu Ser He Asp lit Tyr Ser Arg Thr Leu Phe Trp 

1025 1030 1035 1040 

Thr Cys Glu Ala Thr Asn Thr He Asn Val His Arg Leu Ser Gly Glu 

• 1045 1050 1055 

Ala Met Gly Val Val Leu Arg Gly Asp Arg Asp Lys Pro Arg Ala He 

1060 1065 1070 

Val Val Asn Ala Glu Arg Gly Tyr Leu Tyr Phe Thr Asn Met Gin Asp 

1075 1080 1085 

Arg Ala Ala Lys He Glu Arg Ala Ala Leu Asp Gly Thr Glu Arg Glu 

\ 1090 1095 1100 

Val Leu Phe Thr Thr Gly Leu He Arg Pro Val Ala Leu Val Val Asp 
1105 1110 1115 1120 

Asn Thr Leu Gly Lys Leu Phe Trp Val Asp Ala Asp Leu Lys Arg He 

1125 1130 1135 

Glu Ser Cys Asp Leu Ser Gly Ala Asn Arg Leu Thr Leu Glu Asp Ala 

1140 1145 1150 

Asn He Val Gin Pro Leu Gly Leu Thr He Leu Gly Lys His Leu Tyr 

1155 1160 1165 

Trp ne Asp Arg Ghi Gki Ghi Met He Glu Arg Val Glu Lys Thr Thr 

1170 1175 1180 

Gly Asp Lys Arg Thr Arg He Ghi Gly Arg Val Ala His Leu Thr Gly 
1185 1190 1195 1200 

Tift His Ala Val Glu Glu Val Ser Leu Glu Glu Phe Ser Ala His Pro 
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1205 1210 1215 

Cys Ala Arg Asp Asn Gly Gly Cys Ser His He Cys De Ala Lys Gly 

1220 1225 1230 

Asp Gly Thr Pro Arg Cys Ser Cys Pro Val His Leu Val Leu Leu Gin 

1235 1240 1245 

Asn Leu Leu Thr Cys Gly Glu Pro Pro Thr C^s Ser Pro Asp Gin Phe 

1250 1255 1260 

Ala Cys Ala Thr Gly Glu He Asp Cys He Pro Gly Ala Trp Arg Cys 
1265 1270 1275 1280 

Asp Gly Phe Pro Glu Cys Asp Asp Gin Ser Asp Glu Glu Gly Cys Pro 

1285 1290 1295 

Val Cys Ser Ala Ala Gin Phe Pro Cys Ala Arg Gly Gin Cys Val Asp 

1300 1305 1310 

Leu Arg Leu Arg Cys Asp Gly Glu Ala Asp Cys GM Asp Arg Ser Asp 

1315 1320 1325 

Glu Val Asp Cys Asp Ala De Cys Leu Pro Asn Ghi Phe Arg Cys Ala 

1330 1335 1340 

Ser Gly Gin Cys Val Leu He Lys Ghi Ghi Cys Asp Ser Phe Pro Asp 
1345 1350 1355 1360 

Cys He Asp Gly Ser Asp Glu Leu Met Cys Glu He Thr Lys Pro Pro 

1365 1370 1375 

Ser Asp Asp Ser Pro Ala His Ser Ser Ala He Gly Pro Val lie Gly 
1380 1385 1390 
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ne ne Leu Ser Leu Phe Val Met Gly Gly Val Tyr Phe Val Cys Gin 

1395 1400 1405 

Arg Val Val Cys Gin Arg Tyr Ala Gly Ala Asn Gly Pro Phe Pro His 

1410 1415 1420 

Glu Tyr Val Ser Gly Thr Pro.His Val Pro Leu Asn Phe He Ala Pro 
1425 1430 1435 1440 

Gly Gly Ser Gin His Gly Pro Phe Thr Gly lie Ala Cys Gly Lys Ser • 

1445 1450 1455 

Met Met Ser Ser Val Ser Leu Met Gly Gly Arg Gly Gly Val Pro Leu 

1460 1465 1470 

Tyr Asp Arg Asn His Val Thr Gly Ala Ser Ser Ser Ser Ser Ser Ser 

1475 1480 1485 

Thr Lys Ala Thr Leu Tyr Pro Pro lie Leu Asn Pro Pro Pro Ser Pro 

1490 1495 1500 

Ala Thr Asp Pro Ser Leu Tyr Asn Met Asp Met Phe Tyr Ser Ser Asn 
1505 1510 1515 1520 

ne Pro Ala Thr Ala Arg Pro Tyr Arg Pro Tyr lie He Arg Gly Met 

1525 1530 1535 

Ala Pro Pro Thr Thr Pro Cys Ser Thr Asp Val Cys Asp Ser Asp Tyr 

1540 1545 1550 

Set Ala Ser Arg Trp Lys Ala Ser Lys Tyr Tyr Leu Asp Leu Asn Ser 

1555 . 1560 1565 

Asp Ser Asp Pro Tyr Pro Pro Pro Pro Thr Pro His Ser Gin Tyr Leu 
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1570 1575 1580 

Ser Ala Glu Asp Ser Cys Pro Pro Ser Pro Ala Thr Glu Arg Ser Tyr 
1585 1590 1595 1600 

Phe His Leu Phe Pro Pro Pro Pro Ser Pro Cys Thr Asp Ser Ser 
1605 1610 1615 



<210> 4 
<211> 1615 
<212> PRT 
<213> Homo sapiens 

<400 > 4 

Met Glu Ala Ala Pro Pro Gly Pro Pro Tip Pro Leu Leu Leu Leu Leu 
1 5 10 15 

Leu Leu Leu Leu Ala Leu Cys Gly Cys Pro Ala Pro Ala Ala Ala Ser 

20 25 30 

Pro Leu Leu Leu Phe Ala Asn Arg Arg Asp Val Arg Leu Val Asp Ala 

35 40 45 

Gly Gly Val Lys Leu Glu Ser Thr lie Val Val Ser Gly Leu Glu Asp 

50 55 60 

Ala Ala Ala Val Asp Phe GUi Phe Ser Lys Gly Ala Val Tyr Trp Thr 
65 70 75 80 
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Asp Val Ser Glu Glu Ala He Lys Gin Thr Tyr Leu Asn Gin Thr Gly 

85 90 95 

Ala Ala Val Gin Asn Val Val lie Ser Gly Leu Val Ser Pro Asp Gly 

100 105 110 

Leu Ala Cys Asp Tip Val Gly Lys Lys Leu Tyr Tip Thr Asp Ser Glu 

115 120 125 

Thr Asn Arg lie Glu Val Ala Asn Leu Asn Gly Thr Ser Arg Lys Val 

130 135 140 

Leu Phe Trp Ghi Asp Leu Asp Gin Pro Lys Ala De Ala Leu Asp Pro 
145 150 155 160 

Ala His Gly Tyr Met Tyr Trp Thr Asp Trp Val Glu Thr Pro Arg He 

165 170 175 

Glu Arg Ala Gly Met Asp Gly Ser Thr Arg Lys He He Val Asp Ser 

180 185 190 

Asp He Tyr Trp Pro Asn Gly Leu Thr De Asp Leu Glu Glu Ghi Lys 

195 200 205 

Leu Tyr Trp Ala Asp Ala Lys Leu Ser Phe De.His Arg Ala Asn Leu 

210 215 220 

Asp Gly Ser Phe Arg Gki Lys Val Val Glu Gly Ser Leu Thr His Pro 
225 230 235 240 

Phe Ala Leu Thr Leu Ser Gly Asp Thr Leu Tyr Trp Thr Asp Tip Ghi 

245 250 255 

Thr Are Ser He His Ala Cys Asn Lys Arg Thr Gly Gly Lys Arg Lys 
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260 265 270 

GIu ne Leu Ser Ala Leu Tyr Ser Pro Met Asp He Gin Val Leu Ser 

275 280 285 

Gin Glu Arg Gin Pro Phe Phe His Thr Arg Cys Glu Glu Asp Asn Gly 

290 295 300 

Gly Trp Ser His Leu Cys Leu Leu Ser Pro Ser Glu Pro Phe Tyr Thr 
305 310 315 320 

Cys Ala Cys Pro Thr Gly Val Gin Met Gin Asp Asn Gly Arg Thr Cys 

325 330 335 

Lys Ala Gly Ala Glu Glu Val Leu Leu Leu Ala Arg Arg Thr Asp Leu 

340 345 350 

Arg Arg lie Ser Leu Asp Thr Pro Asp Phe Thr Asp lie Val Leu Gin 

355 360 365 

Val Asp Asp ne Arg His Ala He Ala lie Asp Tyr Asp Pro Leu Glu 

370 375 380 

Gly Tyr Val Tyr Trp Thr Asp Asp Glu Val Arg Ala lie Arg Arg Ala 
385 390 395 400 

Tyr Leu Asp Gly Ser GLy Ala Ghi Thr Leu Val Asn Thr Glu lie Asn 

405 410 415 

Asp Pro Asp Gly lie Ala Val Asp Trp Val Ala Arg Asn Leu Tyr Trp 

420 425 430 

Thr Asp Thr Gly Thr Asp Arg He Glu Val Thr Arg Leu Asn Gly Thr 
435 440 445 
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Ser Aig Lys He Leu Val Ser Glu Asp Leu Asp Glu Pro Arg Ala He 

450 455 460 

Ala Leu His Pro Val Met Gly Leu Met Tyr Trp Thr Asp Trp Gly Glu 
465 470 475 480 

Asn Pro Lys He Glu Cys Ala Asn Leu Asp Gly Gin Glu Arg Arg Val 

485 490 495 

Leu Val Asn Ala Ser Leu Gly Trp Pro Asn Gly Leu Ala Leu Asp Leu 

500 505 510 

Gin Glu Gly Lys Leu Tyr Trp Gly Asp Ala Lys Thr Asp Lys He Glu 

515 520 525 

Val fle Asn Val Asp Gly Thr Lys Arg Arg Thr Leu Leu Glu Asp Lys 

530 535 540 

Leu Pro His fle Phe Gly Phe Thr Leu Leu Gly Asp Phe He Tyr Trp 
545 550 555 560 

Thr Asp Trp Ghi Arg Arg Ser He Glu Arg Val His Lys Val Lys Ala 

565 570 575 

Ser Arg Asp Val He He Asp Glu Leu Pro Asp Leu Met Gly Leu Lys 

580 585 590 

Ala Val Asn Val Ala Lys Val Val Gly Thr Asn Pro Cys Ala Asp Arg 

595 600 605 

Asn Gly Gly Cys Ser His Leu Cys Phe Phe Thr Pro His Ala Thr Arg 

610 615 620 

Cys Gly Cys Pro He Gly Leu Glu Leu Leu Ser Asp Met Lys Thr Cys 
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625 630 635 640 

ne Val Pro GIu Ala Phe Leu Val Phe Thr Ser Arg Ala Ala De His 

645 650 655 

Arg lie Ser Leu Glu Thr Asn Asn Asn Asp Val Ala lie Pro Leu Thr 

660 665 670 

Gly Val Lys Glu Ala Ser Ala Leu Asp Phe Asp Val Ser Asn Asn His 

675 680 685 

He Tyr Trp Thr Asp Val Ser Leu Lys Asn He Ser Arg Ala Phe Met 

690 695 700 

Asn Gly Ser Ser Val Glu His Val Val Glu Phe Gly Leu Asp Tyr Pro 
705 710 715 720 

Glu Gly Met Ala Val Asp Tip Met Gly Lys Asn Leu Tyr Trp Ala Asp 

725 730 735 

Thr Gly Thr Asn Arg He Glu Val Ala Arg Leu Asp Gly Gin Phe Arg 

740 745 750 

Gin Val Leu Val Tip Arg Asp Leu Asp Asn Pro Arg Ser Leu Ala Leu 

755 760 765 

Asp Pro Thr Lys Gly Tyr Be Tyr Trp Thr Glu Trp Gly Gly Lys Pro 

770 775 780 

Arg He Val Arg Ala Phe Met Asp Gly Thr Asn Cys Met Thr Leu Val 
785 790 795 800 

Asp Lys Val Gly Arg Ala Asn Asp Leu Thr lie Asp Tyr Ala Asp Gin 
805 810 815 
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Arg Leu Tyr Tip Thr Asp Leu Asp Thr Asn Met He Glu Ser Ser Asn 

820 825 830 

Met Leu Gly Gin Glu Arg Val Val He Ala Asp Asp Leu Pro His Pro 

835 840 845 

Phe Gly Leu Thr Gin Tyr Ser Asp Tyr He Tyr Trp Thr Asp Trp Asn 

850 855 860 

Leu His Ser lie Glu Arg Ala Asp Lys Thr Ser Gly Arg Asn Arg Thr 
865 870 875 880 

Leu ne Gin Gly His Leu Asp Phe Val Met Asp He Leu Val Phe His 

885 890 895 

Ser Ser Arg Ghi Asp Gly Leu Asn Asp Cys Met His Asn Asn Gly Ghi 

900 905 910 

Cys Gly Ghi Leu Cys Leu Ala lie Pro Gly Gly His Arg Cys Gly Cys 

915 920 925 

Ala Ser His Tyr Thr Leu Asp Pro Ser Ser Arg Asn Cys Ser Pro Pro 

930 935 940 

Thr Thr Phe Leu Leu Phe Ser Ghi Lys Ser Ala He Ser Arg Met He 
945 950 955 960 

Pro Asp Asp Ghi His Ser Pro Asp Leu He Leu Pro Leu His Gly Leu 

965 970 975 

Arg Asn Val Lys Ala He Asp Tyr Asp Pro Leu Asp Lys Phe He Tyr 

980 985 990 

Trp Val Asp Gly Arg Gin Asn He Lys Arg Ala Lys Asp Asp Gly Thr 



BNSDOCID: <WO ^0192a91A2LL> 



wo 01/92891 

44 



995 1000 1005 

Gin Pro Phe Val Leu Thr Ser Leu Ser Gin Gly Gin Asn Pro Asp Arg 

1010 1015 1020 

Gin Pro His Asp Leu Ser lie Asp De Tyr Ser Arg Thr Leu Phe Trp 
1025 1030 1035 1040 

Thr Cys Glu Ala Thr Asn Thr He Asn Val His Arg Leu Ser Gly Glu 

1045 1050 1055 

Ala Met Gly Val Val Leu Arg Gly Asp Arg Asp Lys Pro Arg Ala De 

1060 1065 1070 

Val Val Asn Ala Glu Arg Gly Tyr Leu Tyr Phe Thr Asn Met Gin Asp 

1075 1080 1085 

Arg Ala Ala Lys De Glu Arg Ala Ala Leu Asp Gly Thr Glu Arg Glu 

1090 1095 1100 

Val Leu Phe Thr Thr Gly Leu He Arg Pro Val Ala Leu Val Val Asp 
1105 1110 1115 1120 

Asn Thr Leu Gly Lys Leu Phe Trp Val Asp Ala Asp Leu Lys Arg lie 

1125 1130 1135 

Glu Ser Cys Asp Leu Ser Gly Ala Asn Arg Leu Thr Leu Glu Asp Ala 

1140 1145 1150 

Asn He Val Ghi Pro Leu Gly Leu Thr He Leu Gly Lys His Leu Tyr 

1155 1160 1165 

Tip He Asp Arg GM Gin Ghi Met He Glu Arg Val Gin Lys Thr Thr 
1170 1175 1180 
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Gly Asp Lys Arg Thr Arg Ee Gin Gly Arg Val Ala His Leu Thr Gly 

1185 1190 1195 1200 

lie His Ala Val Glu Glu Val Ser Leu Glu Glu Phe Ser Ala His Pro 

1205 • 1210 1215 

Cys Ala Arg Asp Asn Gly Gly Cys Ser His He Cys He Ala Lys Gly 

1220 1225 1230 

Asp Gly Thr Pro Arg Cys Ser Cys Pro Val His Leu Val Leu Leu Gin 

1235 1240 1245 

Asn Leu Leu Thr Cys Gly Glu Pro Pro Thr Cys Ser Pro Asp Gin Phe 

1250 1255 1260 

Ala Cys Ala Thr Gly Glu He Asp Cys fle Pro Gly Ala Trp Arg Cys 
1265 1270 1275 1280 

Asp Gly Phe Pro Glu Cys Asp Asp Gin Ser Asp Glu Glu Gly Cys Pro 

1285 1290 1295 

Val Cys Ser Ala Ala Ghi Phe Pro Cys Ala Arg Gly Gin Cys Val Asp 

1300 1305 1310 

Leu Arg Leu Arg Cys Asp Gly Glu Ala Asp Cys Ghi Asp Arg Ser Asp 

1315 1320 1325 

Glu Val Asp Cys Asp Ala He Cys Leu Pro Asn GM Phe Arg Cys Ala 

1330 1335 1340 

Ser Gly Gin Cys Val Leu He Lys Gin Ghi Cys Asp Ser Phe Pro Asp 
1345 1350 1355 1360 

Cvs ne Asp GlY Ser Asp Glu Leu Met Cys Glu He Thr Lys Pro Pro 
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1365 1370 1375 

Ser Asp Asp Ser Pro Ala His Set Ser Ala lie Gly Pro Val He Gly 

1380 1385 1390 

ne He Leu Ser Leu Phe Val Met Gly Gly Val Tyr Phe Val Cys Gin 

1395 1400 1405 

Arg Val Val Cys Gin Arg Tyr Ala Gly Ala Asn Gly Pro Phe Pro His 

1410 1415 1420 

Glu Tyr Val Ser Gly Thr Pro His Val Pro Leu Asn Phe lie Ala Pro 
1425 1430 1435 1440 

Gly Gly Ser Gin His Gly Pro Phe Thr Gly He Ala Cys Gly Lys Ser 

1445 1450 1455 

Met Met Ser Ser Val Ser Leu Met Gly Gly Arg Gly Gly Val Pro Leu 

1460 1465 1470 

Tyr Asp Arg Asn His Val Thr Gly Ala Ser Ser Ser Ser Ser Ser Ser 

1475 1480 1485 

Thr Lys Ala Thr Leu Tyr Pro Pro He Leu Asn Pro Pro Pro Ser Pro 

1490 1495 1500 

Ala Thr Asp Pro Ser Leu Tyr Asn Met Asp Met Phe Tyr Ser Ser Asn 
1505 1510 1515 1520 

De Pro Ala Thr Ala Arg Pro Tyr Arg Pro Tyr De He Arg Gly Met 

1525 1530 1535 

Ala Pro Pro Thr Thr Pro Cys Ser Thr Asp Val Cys Asp Ser Asp Tyr 
1540 1545 1550 
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Ser Ala Set Arg Trp Lys Ala Ser Lys Tyr Tyr Leu Asp Leu Asn Ser 

1555 1560 1565 

Asp Ser Asp Pro Tyr Pro Pro Pro Pro Thr Pro His Ser Gin Tyr Leu 

1570 1575 1580 

Ser Ala Glu Asp Ser Cys Pro Pro Ser Pro Ala Thr Glu Arg Ser Tyr . 
1585 1590 1595 1600 

Phe His Leu Phe Pro Pro Pro Pro Ser Pro Cys Thr Asp Ser Ser 
1605 1610 1615 



<210> 5 
<211> 3096 
<212> DNA 
<213> HoEQO sapiens 

<400> 5 

catcttctca cacgatctct cgcttcgcac tccttccttt gattggtttt caccatttac 60 
tcagacgacg gtccttcttc gatctttgca cattcttcta tcatctacta ccttcatacc 120 
cagctccgtc ccctaatatt catgcgcgga tggcccattc cgtggtgaaa attcccttct 180 
actctgctaa tctgctgttc tctctccctc ccgtcgggtt ctgctcctgc cacgttctcc 240 
cctctcccca ccaaaggctg ggttttcttt gtcagggctc ctttcccctt tggaagaagg 300 
ggggctgtat ggccttggtg cgaggccctc cagtgacagg atcccccatc acccagagtt 360 
ccacaggccc tggtagggag gagggeeagc agaagaggag gtgccatctt tgcctgctgg 420 



wo 01/92891 

48 
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ggaagggcag gggccaccca cacagagctc tcccatttgc tgtggaccct ggggccactg 480 
cccagttcct tccaaaggaa agccagctcc ccaggtggtg ggagagtgat atggcttcct 540 
cttaaactta gggaattgag tgtgtggttg cttctaagtg ccttagaagc cgggagcggc 600 
tcctggaaag agcctgcctg ccacagcggg ccttaccctg gctgtgccca cagatgtccc 660 
tggggcctgc cgctcctgcc cggctctcct ggcctccccc ggtgtgggtt gggaaaagca 720 
cagcaaatta aaaaacacct ccatctctgg cctttgaaga atgcatctga acagccgaga 780 
gtgtaaaccg tggtgaaatg tggtctttcc agtttgggga gaagcagggc agagctgggg 840 
cttttgtacc cagggtttcc aagagctcct gcctccctcg gctgggctgg ccagggcccc 900 
ccgctgggac ctccagctgt aatagggaag gttttactgg gttgctggcc actgtggact 960 
gcccctaagg gcaggtatgc ctgcctttac ccgggttccc ctcctgcctg gaagatacag 1020 
cccatgggag gcctgttgtc tgtgggatcc tccagcatca gagacactgg ggccagcgtc 1 080 
tgcctggtga ggtgcaggcc tggcaggccc ggtcccccac ctgcttgagc acccacggtg 1 140 
gtgggggctc gctgcctccc gagacaatct atgtcattgt tgtccaagga agctaattta 1200 
gagtagaaag ttccgtgtcc agtcccacto tgtgcgtgtg ttagcagggg actctcgggc 1260 
cggagctggg tccaccctgg tagggggact tcatggggcc tgggcgacag cactgtgtat 1320 
ttgtgtgtgt gtgtgtttgt gtgtgtgtgt gtctgaggag gtggaccagt ttctcaaaag 1380 
gcctgtgacc ccaagaacca aggaatttca gcctgggtgg atcacacctt cactggtgag 1440 
tgggacaagc tgggggccct cgccacagga gcagccaggg catggggcac agttggcctc 1500 
attcacaaaa tgggagtata agtgatccct gctetggcgg ccaggacgat gagtgggaac 1560 
acaccgtgtg ggggctgcct ggcctgggtg tgccgcgggt gtccttgttg gtgatggtto 1620 
cacctgcttg tgccaccag^ gcxctctggg tctcacacac aactctcttc ccagcgaagg 1680 
cccctcctgc cctcaggcct cagtgctgct tccgtctcgg aaggccccag gagctcctgc 1740 
atcctgggcg tgattcctgt gtgcctgcag accccctogc ggctgccatc tcatcctttg 1800 
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gtgcacctgt tggccagacc tcctggtagc gggtgctgca ctcccctgaa tgtgccgggg 1860 
cctgggggca gggacctggg ctcctccctc actgagtgga gggaactcag tgtcttggag 1920 
ttggggtgcc tgcaggctgg gtggtgcagg tgaaatgcag acctctcagc tggtgttcca 1980 
gagcagctgc cttcccccgc ccgagggact tcacccgcag cccagtcagg ggtggcgcct 2040 
gggtgcatcg cccgcaggct gggtaggggt ggagcctggg tggccctgcc tgtgagctgc 2100 
atagttgtcg cctttgaccc tgagttttct tcgttatctg tttggacctg tttggggcag 2160 
gcaggggatg agatctgaag ataaatgcct tagctgtgac catctccttt tgtgagaggt 2220 
caatgtccag ttccgctgca gttataacat cccatltttt gatttctttt tattttttcc 2280 
tttttctttt tgagatggag tctcgctctg tcacccaggc tggagtgcaa tggggtgacc 2340 
tcagctcact gcaacctcca cttctcgggt tcaagtgatt ctcctgcctc agcctcctga 2400 
ctagcagggg ttacaggcgt gagccaccac gcccagctaa tttttgtatt tttagtagag 2460 
gcaaggtttc gtcatgttgg ccaggctggt ctcaaactcc tggccttaag tgatctgccc 2520 
gcctcggcct cccaaagtgc tgagatgaca ggtgtgagcc accgtgcccg gcccagaact 2580 
ctttaattcc cacctgaaac ttgccgcctt aagcaggtcc ccagtctccc tcccctagtc 2640 
cctggtccca ccattctgct ttctgtctca atgaatttgc ctaccgtaag tacctcatat 2700 
aaattgaatc ataaagtatt tgtcttttta tatctggctt atttcactta gcataacatt 2760 
cttaagtttc atccatgttg tagcatgtgt cagaatctct ctcttttttt tttttttttt 2820 
tttttttttt ttttgcagac agagtacgc tctgtcatct agactggagt tcagtggcac 2880 
gatctcggtt cactgcaaca tctgcctcct gggtccaagc aattctcctg cctcagcctc 2940 
cttagcagct ggaactacag gcgcgtgcca ccatgccttg ctaatttttg tatttttatg 3000 
tggaggcagg gtttcaccat cttggccagg ctggtctcga attcctggtc ttcaccacgg 3060 
gggcccgaag gacccgggca aagcgtggag gggagg 3096 
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<210> 6 
<211> 26928 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> unsure 

<222> (12044),(12489),(26433),(26434).(26435),(26436).(26439).(26441) 
<223> Identity of nucleotide sequences at the above locations are unJmown. 

<400 > 6 

gaagaccaag ggcacacagc gaggcagttt cagggcgggc agcdtggggc cccacggggc 60 
ggccccggac acttgttctc acctgtggag ggcagagaag ggaacaggga gagaagtggc 120 
cggctgggag tggaggtggg tttgaggttt tactgtaaac taaatgtgta ccctctacct 180 
tagttatgaa ttatgagaca cgaagactgc gaaacagaca cactcctcta aaagtgcctc 240 
taggctgaca gggagaaagt cccgccaggc tcccagacgc cacctttgag tccttcaaca 300 
agcccgccag ggcctcttgc ccaccggtgt cagctcagcc actgaaccct ccaggaagaa 360 
gacgtgctgg taggagaaga atctcaccca ggcacagcct ggaaggggca cagaaggggc 420 
tccggaacca gcaagcccaa gttggaactc ccagtctgct actttctaga acgactgtgc 480 
ccttggcggg tctaagtaga acctctccgc gcactctttc ctcctttgta aagtggggac 540 

agcaatggccaccttgcaggttcagagagggcttgcagtacctcacagaactgagtgccc 600 
gtgaacgtgt gtgttcctcc agatttgtga cagctttgcc aggctggagt caggctgaac 660 
Ecctctgccc tcatggggtt tatattctag gaagaccaac aaaaacaaga agacggaaaa 720 
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ttaaaacaac aaaagcccca ttgacaggcc gtgaagaatg ccatgaaaaa tgaatggcgt 780 
tgtgctgcag tctttgggga aacgggctta cggaaagaag gacacttgag ctgctaccaa 840 
tgagcagccg tccggtggga gggcagttca ggaagagcag acatccactg aggaggcgct 900 
ggggcagagg gcagcctggt cgctggattc gggggaggaa ccacatcagg ccatgagctg 960 
gagctggtgg tagaatgtac aggagaggcc agccagggcc agctcatgtc agacctcaag 1020 
cggggaagat gaatcgagaa tgcaccccac gagcaatggg aagccagtct acgatttaag 1080 
cagcaaaaat attttccctt cttccaccct gcatccagct ctaccagcac agcctggggt 1 140 
tctattttca agatagaata gacccagact cccagctctt cttacacttc tactactgcc 1200 
acctgtcacc cactcatgcg tccccacttg cagcctcgac ccccttccac ctgatctcat 1260 
ggcagccagg gaagctccag ggctcgtgag ggctgccatc tcaggaaaga agcaaaagcc 1320 
ttcggcacct gcagggcctg ctccaaccac acttcttcct tgacctctca gcttccttag 1380 
ccactccctt cccacatctc accctgctcc agccacagtg gtgtctctgt gggttctcaa 1440 
acacaccagg tgcactcctg cctcagggcc tttgtgcttg ctgttctctg ctgggactct 1500 
tttttttttt tttttttttg agacagggtc tcactctgtg gcccaggctg gagtgtagtg 1560 
gtgtgatcgt agctcattgc aacctcaaac tcctgggctc aagcaatcct cccacctcag 1620 
cctctcaagt agttagcttt tgttgttttg ttttgagatg ggatctcact ctgttgccca 1680 
ggctggagtg cagtggggca atcttggctc accacaacct ctgcctccca ggctcaagca 1740 
attctcctgc ctcagcctcc caagtagctg ggattacagg catgtgccac cacgcccagc 1800 
ttatttttgt atttttagta gagacagggt ttcaccatgt tggtctggct ggtcttgaac 1860 
tcctggcctc agatgatcca cctgcctcgg cctcccaaag tgctgggatg acaggcatga 1920 
gcctgtctct agtagttagg actacagaga ggggccatca tgcctggtga tcctcccacc 1980 
ttttctgctc caactctttc accccactta gcctcgtggc tcactctctt acctcttcag 2040 
ctcctcagtc aggcctgagg acccctgttg aaaattgcaa accacacccc ccaccaccac 2100 
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cacccactat tgccagcact ttctactcca tttctctgct ttacttttct cctttgtact 2160 
catcaccacc tgactcatta catgtttacg tatctttctt ctctccacta gcatggaagc 2220 
tccaggagag cagagagtgt agttttattc cctgatgtgt ttcctgtgcc cgtaccaggg 2280 
cctagcacac agtaggtgct cagtaaatgt gtgttggatg aacaaataca gtgaaaggat 2340 
ctgatctaca tttataaaga aggcactctg gctgctgagt ggggatgaga ctgtcaggag 2400 
gaaagaggcc cctgtggggg cctggccagc aggtgggtac aatggtagca gccaggagag 2460 
agggcctctt ggactcaagt ggatggggcc tgctcagggc tecggccaca ggaacaaagg 2520 
gaagggggcx; caggatggcc tgtcatagag gacacattac aactggccca aagttcaagt 2580 
caggtttcta aatttgggaa gggatacaga aaaactaaag actctactgg acagtcagtt 2640 
attgaaatga ttacatagaa aatgtaccaa gaattaaaaa aaa^aa^^f^f^ aagcattatg 2700 
aaggggccac cagagactcc cagagaggaa agggactatg ggctggatgc ggtgactcac 2760 
acctataatc ccagcacttt gggaggccga ggagggtgga tcacgaggtc aggagttcaa 2820 
aaccagccta ggcaacatgg taaaaccccc gtttctacta aaaatacaaa aaattagctg 2880 
ggcatggcag catgtgcctg taatcccagc tactcgggag gctgaggcag gagagttgct 2940 
agaacccagg aggcagaggt tgcagtgagc cgagattgag ccactatgct ccagcttggg 3000 
cgacagagca agactccgtc tctaaaaaaa agaaaaaaaa ggccagatga ggtggctcat 3060 
gcctgtaatc ccagcacttt gggaggccga ggtgggtgga tcacgaggtc aggagatcga 3120 
gaccatcctg gctaacatgg tgaaactcca tctctactta aaatacaaaa aattagccgg 3 1 80 
gcgtggtggc gggcacctgt agtcccagct acttgggagg ctgaggcagg agaatggcgt 3240 
gaacctggga ggcggagctt gcagtgagcc gagattgcgc cactgcactc catccagcct 3300 
gggcgacaga gttagactcc gtctcaaaaa aaaaaaaaaa aaaaaaatta gctgattagt 3360 
tgggcttggt ggcgggcgcc tgtaatccca actactcggg aggctgaggc gggagaatca 3420 
cttgaacccg ggaggcagag gttgcaatga gccgatatca cgccactaca ctccagcctg 3480 
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ggcgacagag caagactcca tctcaaaaaa gaaaaaaaaa aagaaagggg ctgtgctgtg 3540 
gcctgggacc caaagcacac tactgcaagg tcccagggtg cctgactcca accgga:gcct 3600 
tgagaacatt catttgcaaa gaatgaatta aaattcagca ctattttatt ctgcaggatt 3660 
ccagcacccc aaggacagtc atttttagac ccttcagtaa cgtaataagt aaccggagga 3720 
tgtgctgagc ttccacttcc ccagacggtt gcctgtcaca gctcatcagg ccaacaaact 3780 
tttcttaggc ctcaaatttg gaaatgttca ctctcagttc gttccttaga tgcaagtcca 3840 
tcccaatgaa gtaacagggg ctcagcacct gtccaatctc attgcttccg gggacagggg 3900 
cccatgagga tgtcgtttca gcccggtgac acttgggcaa agtgcctttt ggtttccctc 3960 
ccaggctgga acgtgctggc tctgtgaagt tacgctgggc acaagagccc cccccaaccc 4020 
ggcaggactgactgctgtggtcagaggcgcccctggggctttgggagccacagaatcttc 4080 
ctgagggcag cgccggagga ggccccagtg agagtgccca ctgccaggct cattcctcag 4140 
gctgccgcag gcctctcccc aaaacaggca atgcttctca gcaacctgcc ccaggagcag 4200 
gccagggaag gccgccatcg gcctacagtg ctgggctctg gagggcttgg ttggtaacag 4260 
gccatggttt ctatgagcca gctggggtgt gaaggacaca ggctggattc acctctctgg 4320 
gcctcagttt ctgcattcaa aaagtgggaa tcatgatatc tgctctattt cttatctctc 4380 
agtgctgatg tgaacctcca ataagacttt taaaaatact ctttctacct tacttttatt 4440 
tttcatttat tttaagataa tgtctagctg tctcacccag gctggagtgc agtggtgtga 4500 
ttacggctca ctacagcctt aacctcccag gctcaagtga tcctcctacc acagcctccc 4560 
aagtagctgg aactacaggc atgcaccacc gcacctggat aattttttct tttgagacaa 4620 
ggtttcactc tgttgcccag gctggagtgc agtggtgcac tcttggctca ctgcagcctc 4680 
aacctccctg ggcttaggtg atcctcacac ttcagtctcc caagtagctg ggactacagg 4740 
tatgtgccag tacacccagc taatattttt gaaggatggg gtttcactat attgcccagg 4800 
ctefftcttsa actccagget ttaagcaatc taccttcctc agcctgccaa agtgctagea 4860 
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ttataggtat gagccacccc ccggcctata atcctaccac tttaaaaaag cctgtaattt 4920 
tagcacttta aaaaattttt ctaaattttt tatagagatg ggggacagct gtggtctcac 4980 
tgtgttgccc aggctggtct tgaactccta ggatcaagcc atcctcctgg cctggcctcc 5040 
caaagtgttg ggattataag cataagcctt accttacctt ttttttttga gttgcagttt 5 100 
tgttcttgtt gctcaggctg gagtgcaatg gcaagatctt ggctcactgc aacctccacc 5160 
tcccgggttc aagcaattct cctgcctcag cctcccgagt agctgggatt acaggcatgc 5220 
gccaccacac ecagctaatt ttgtattttt agtagagatg gggtttctct atatacctta 5280 
attttaaagc actgcattca tgtaaattgt gattaacatg gattcaagag agggagtgag 5340 
gatgaatgag ccaggcagtc acctcggctg tcaccctcca cttctctcct ccttctgaca 5400 
gtcatcgtcc atccgtttct gcagctgttt gtttgactct cctgatcatt ttgcttgcca 5460 
cataacttgc ctcctgggaa agaatgccct gggcaggccc acatgagtag tgaaaaataa 5520 
tctgcagtga aaaataaaac taagtagtct ggtccacaga gcagtcttat tttttcactg 5580 
cagatgaagg agttgacatt caggcttcat tctcatttat aagtgtttta aagacacata 5640 
cagtggattg aacagtggcc ttcaaaaaga tgtatctaca tcctaatccc tgggacctgt 5700 
gaatgttaaccaagttaggaaaagggtcttcxcgggtgtcattaagttagagatcttgag 5760 
atgaggagctcatcg^gattatccaggtggaccctgcatccaaggacaaatggtcctta 5820 
gaaaagaaaa gcagaggctg ggcacagtgg ctcaagcctg taatcccagc actttgagag 5880 
gccgaggtgg gtggatcacc taaggtcatg agttcgagag cagcctggcc aacatgatga 5940 
aatcccatct ctactaaaaa tacaaaaatt agcaaggcat ggtggcgggt gcctataatc 6000 

ccagctactcaggaagctgaggcaggagaatggcttgcacctgggaggcggaggttgcag 6060 
tgagccaaga tcgcgccact gcactccagc ctgagggaga aaagtgaaac tctgtctcat 6120 
aaaagaaaag aaaagcagac agagatctga gacagaagag gagagtgaag gaaaaaaggc 6180 
catgtgaaga tgaggcagag gttggagcca tgcagccaca agccaaggaa tacctggagc 6240 
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cccagaagtt gcaagaggta ggaagaagcc tcccctagag cctccagacg gagcacagcc 6300 
ctgccaacac ctccacctca gacttctggc ctccagcact gtgagataat caactgctgt 6360 
• tgttttaagc caccagattt gtggtaattt gttatggcag ccacaggaaa ctaatacagt 6420 
acctaatctt cacaaaccca tcttacagaa aaggaaactg aagtcagaga ggtagtggct 6480 
tgtgcagtgt gttaggccat tcttgtatta ctataaagaa atacctgagg ccgggcatgg 6540 
tggctcacgc ctgtaatccc agcactttgg gaggccaagg tgagtggatc acttgaggtc 6600 
aggagttcaa gaccagcctg gacaacatgg tgaaacccca tttctactga aaatatgaaa 6660 
attagccagg catgg^gcg tgcatctgta gtcccagcta ctcaggaggc tgaggcagga 6720 
gaatcacttg cgcccgggag gaggaggttg tagtgagcca agattgtgcc actgcactcc 6780 
agcctgggag acaagagaga aaccctgtct caaaataaat aaaaaacaaa taaacacctg 6840 
agactgggta gtttataaag aaaggggtta actggctccc ggttctgcag gctgtacaag 6900 
catggtgccg gcatctgctt ggttgctggg aaggcttcag ggagttttac tcatcgtgga 6960 
aggcagagcc agagcaggtg catcacacag caaaagcagg agcgagagag agagagagca 7020 
gggaggtgtgcacacttttaaatgagcagatctcacgagaactcaccattgcaaggacag 7080 
caccaagcca cgaggggtct gcccccatga cccaaacctc ccactaggcc ccacccccaa 7140 
cattgggaat tacagttcaa catgaggttt ggggggacaa atatccaaac tatatcattc 7200 
cacccctggc cccccagatc tcatgttctt ctcacattgc aaaatatagt catgccttcc 7260 
cagtagcccc ccaaagtctt aactcatccc agcattaact caaaaatccc attcccaagt 7320 
ccaacgtctc atctgaagat gagttccttt cacctacaag actgtaaaaa tgaaaacagt 7380 
tatttactgc tgagatacaa tgggggcata ggcattaggt aaacattcct gttccaaaag 7440 
ggagaaatcg gtcaaaagaa aggggctata ggccccaagc aagtccaaaa cccagcagag 7500 
caatcattca atcttaaagc tccaaaataa cctccttaaa ctccatgtcc catagccagg 7560 
scacactset ecaaeeaeca ggctcccaag gccttgeaca gctctattcc tgcggctttg 7620 
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cagaattcag tccccatggc tgctcttaca gattggagat gagggcctgc ggcttttcca 7680 
ggtgcagggt gcaagctgct ggtgatctac cattctgggg tgtggatggt ggcggccccg 7740 
tcccgcagct ccactaggca ttgtcccagt ggggactcta tgtggggcct ccaaccccac 7800 
atttcccctc caatgggaag gctctgcccc tgcagcagcc ttcttcctgg gctcccaggc 7860 
tttctcatac atcctctgac atctaggtgg atggtgtcaa gcttccttca ctcttgcact 7920 
ctgcacacct acaggcttaa caccacatgg aagctgccaa ggtgtatggc tggaaccctc 7980 
tgaagcagca gcctgagctg tgactatggc cctttgagcc aaggctggag ctggaacagt 8040 
ctagatgcag gcagggagca gtgtcctgag gctgtgcaga gcagcagggc cctgtgcctg 8100 
gacaatgaaa ccattctttc ctcctcatcc tctgggcctg tgatgggagg gttgtggaag 8160 
atctctgaaa tgcctttgag gcctttttgc ctctgaggcc tatttcctat tgtctcagtt 8220 
attggcagtc ggctcctttt tagttatgca aatcctctag caagaggtta ctccactgcc 8280 
ggcttgaact cctctcctga aaaagctttt tctttctttg tcacatggcc aggctgcaaa 8340 
ttttccaaac ttttatgctc tgttttacct ttaaatataa cttctaactt taattcattt 8400 
atttgctcct gcatttgagc atagggaatt caaagaagct gggccacatc ttgaatgctt 8460 
tgctgcttca aaatttatgg ccacgcttgg tggctcacac ctgtaatccc agcactttgg 8520 
gaggcctagg tgggcagatc acgagatcag gagatcgaga ccatcctggt caacatggtg 8580 
aaacccatct ctactaaaaa tacaaaaaaa ttagcttggt gtggtggcgc agacctgtag 8640 
tcccagctac tggagaggct gaggcaggag aattacttga acctgggagg cagaggttgc 8700 
agtgagccca gatcatgcca ctgcactcca gcctggtgac agaataagat ttgatctcga 8760 
aaggaaggaa ggaaggagga agggaagaaa tgtcttcccc ccagatgtcc tgggtcatcc 8820 
ctcttatgtt caaacttcaa cagatcccta gggcatgaaa ataatacagc caaattattt 8880 
gctaaggcat aacgaaagtg acctttgctc cagttcccaa taagttcctc atttccatct 8940 
eagactcatc accctggcct tggcttstcc atatcactgt cagcattttg gtcacaatca 9000 
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„c «^ggga. gc^aggcaa gaggatca.. tgaacccagg agg«gaggc 9060 
tgcagtgagc .g.gat.aca tcacgcagt ccagcttggg caacagagpa aga«ctg.c 9120 
.caa«>aa«aataaama«a.aaataact.aagmat..aaagc.gca.cmg= 9180 
eacca.ggagaaaggccaggccag«»:«c.«cmctgcacg««co«caoco 9240 
agc.gcc.«gc„gaggaacagagggag.aggaa^gcca.cccaggaggccc 9300 

agcacccca. gaccggcc .ggggoc«g tgggmatg ga«cccagt gctgagtcat 9360 
ccc.cacaggc.«tgtgggcacct.ggacat.gg.cagaagoa.g.gg.ccccgggaao 9420 

acacc«cc«atca.c.gggaagggcagcng.gccagcgaggocacctgttcagcg 9480 

. ccacggcccg ccagacagc gcagccacag cCgccm gatcagagca aacaccagac 9540 
a.g.gtg^.gccoccaacccaK.ccaggggacacatgtcotttcttgccaggcc.ga 9600 
ga^aacaagagagggacaagtccccaagcc^ctc^cttcctgcctcacccaotccg 9660 
c.g»ganc^gg«ga.gg«gg«aac.agggcaaccgacca.c«ggMa^ 9720 

^^gaaagag ggggcam. caggaataaa acgcaaaag tcggagcaa acaggagcaa 9780 
g„ggtcactCgggg«ggtggag«-gg™-8gccccc.-cgcaagc 9840 
■ a.ggg«gaacccaggacaggaacacagagcaggccccaggacogggcttg.cact.aca 9900 

agtcmmttttttttttttttgagaJggagtatgctctg.ca.cagggaggag«a 9960 
,^ocatcttagctcactgcaacc.c,gccttc«ggttcaag«a.ccccc,gc 10020 
e,cagcc«c.gag«gctgggac.acaggtggcaccaocacgcccag«aa«mgt 10080 

amcugu gagatgaga. ggccaggc^ gtcttgaaot cctgacotca ag.ga«:.gc 10140 
^gccttggcc.cccaaag.gctggga«acaggtg.gagcca«gtgcc.ggc^cact 10200 

,3caag«t aaaccalgcc .cagcaca.c aa«ccam acaaaaagg. agagggattt 10260 
^caaaaa«.ga.gaaagaca.agga«aaga.ca.g.cag«taaacataggt 10320 
e.«t.ctat.aa»aatt.a..ga.«a.««««-c™tccca.cactt 10380 
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tgggaggccg aggcgggcgg atcacgaggt caggagatcg agaccatcct ggctaacacg 10440 

gtgaaaccccatctctactaaaaatacaaaaaatggccgcgcgcggtgactcacgcctgt 10500 
aatcccagca ctttgggagg ccaaggcggg cggatcacga ggtcaggaga tcgagaccat 10560 

cctggctaacacagtgaagccccgtctctactaaaaaatacaaaaaaaattagccaggca 10620 
tggtggcggg cgcctgtagt cccagcaact tgggaggctg aggcaggaga agaatggtgt 10680 
gaacctggga ggtggagctt ccagtgagcc gagatcacac cactgcactc cagcctgggc 10740 
gacagagtga aactccatct caaaaaaaaa ataaataaat aaataagaat tgttagtatt 10800 
ttgcaggtgtgacaaatgattctgtttctgtggcagaatgttctcaggagatctcttttg 10860 
aactctcatggaaagcatcatgctgttggcaacatcacatttatttttatttatttatta 10920 
ttttttagag acagggtctt gctctgttgc ccaggctgga gtgcagtggc acaatcacag 10980 

ctc^ctgcagcctcaacctcctgggctcaagcaatcctcctgcctcagcctccc^ 11040 

grtgggacca caggcgtgag ccactgcact cagcccaatg taccttcaat atttacattt lllOO 
ctggcaaagg tagcaaaacc ttaacaaatt ttgaatctag ataataaaat tatgaggctg 11160 

ggtgcagtggccctgacagggatggctcacatctgtaatctcaacattttgggaggccaa 11220 
ggtaggcgga tcacctgagg ccaggagttt gagaccagcc tggccaacat ggtgtaaccc 11280 

tgtctctaacaaaaatacaaaaaaattagccagacgtggtggtgcacgtctgtcatccca 11340 
gctactaggg aggctgaggc aggagaattg cttgaacccg agaggcagag gttgtgatga 11400 
gccgagatcg cgtcattgca ctccagcctg ggcaaaagca agagcgaaac totctctcca 11460 
aaaaataaaaaaaaaataaattaatgaattaattaaaataaaataaaataatggatagtc 11520 
actgtaaagaaaaaataaatgtatatatcagccaacaagtgatggaatagagcaccc^^ 11580 
ctccctggctggacagatacatcccacaacacctggaaggcggctccatgtagaactttc 11640 
tggactgcttgaggtgctgtgctggagcacggtgacagaggagctggaccatggacxrtcc 11700 
ccceeccccc accaaeescE a^^tccccct ets^HEStc teagsgaggc atccmtgg 1176O 
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cctctgcggc ttgggcaggg aatttggggt ccaagtactt ggtgcaaagc ctggaaagag 1 1820 
ggtttgggtg ctgagggcat atcccctggg ccacatgggg gcagaagtgg ggcccxctga 1 188Q 
agcttggagt cctgggcagg ggcatctatt ttgctgtctg aggccttcag tacttgaagc 11940 
aaaatggagg cagaatgtcc caccttaatg cccctgattc ctccaaacca attccagaga 12000 
cagcaagggccagaacagggatggccctgcccagggtcatgcancgaggaagtggccagg 12060 

ctgggatctg aacccaggct aatcccctcc cttgtcctcc tccaggccct cacccctgca 12120 
tagagccctccagctcactcatcctcggccagctccatctcctcagcttgtaaacccccc 12180 

cgggattttc ctttcttaaa aaacaaaggc ttggccaggc acggtggctc acgcctgtac 12240 
tttgggggtg gctcccagca ctttgggagg ccaaggtggg cggatcatga ggtcaagaga 12300 
ttgagaccattctggccagcatggtgaaaccctgtatttactaaaaaaaaaaaaattaac 12360 
tgggcatggt ggctagctac ttaggaggct gaggcaggag aatcgcttga acctgggaga 12420 
aagaggttgc agtgagccaa gatcgcgcca ctccacttta acctggcaac agaacaagat 12480 
tccgtttcna aaaacaaaca aacaaacaaa taaacaaaaa aaggcggagc gcgatggctc 12540 
gcgcctgcaa tcccagcact ttgggaggct gaggcgggcg gatcacttga ggttaggagt 12600 
ttgagaccag cttggccaac atggtgaaac cccatttcca ctaaaagtac aaaaatcagc 12660 
caggtgtggt ggtgggtgcc tgtaatccca gctactcagg aggctgaggc aggagaatcg 12720 
cttgaaccca tgacctggag gctacagtga gctgagattg cgccactgta ctccagcttg 12780 
ggcaacaaga tttgtttctc taaaaaaaaa aaaaaaaaga ctggcccttc cccttcagct 12840 
cttcctcagg gtccctgagc actctacacc cccgtctaca ctgagcactc caccctgctg 12900 
tctacactga gcactccacc ctgccatcta cactgaggac tccaccccac tgtctacact 12960 
ggctgcctcc cgccctcacc tcctgctaag gccattcccc gctgcatctg tcttctagat 13020 
tctgcagcct tcagcacgct gggcccctcc tttgtcccct tgagccacct ccagcctccc 13080 
cctgagctgc tactcctctc ccagcagcct ccacccaagc ccctccagtc cccaagctgt 13 140 
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cccttgcatc cagcactgcc cttccacgtg ccccttccct ccagcttcac agcagggtgg 13200 
ggcctccagg ccctgcccac tgtgcccatc cacaagttgt ggtgggagct ccgaggggag 13260 
gcaggggtgt gcatggactt gggacgtcca agtctgggac caggggcagc tggttggtgg 13320 
agtgtggagg gggataggga ctttcaggta gagaggctgt aggggcaaga tcgggacggc 13380 
ggatgtccct aaggagggct ctgacctggg aaatattgtg cagcttcctc tttgccattc 13440 
ctggagctca gacactggcc ggctctcacc ccgcccttcc tgcaggacac agctccatcc 13500 
cagtgagttc ctagtgtaga catctccagc agcacggatg ggaaaggaag tcatcaaagg 13560 
tgcccaggac cggaggcttt ttctggaggt ggcagaggag ggtgtgggtc tcagggctct 13620 
ggctgagggc aagcgtggga ggtcttaggt ctgcaccagc cccgtgaagg cccctcctgc 13680 
tccctggtgg agtcctagag ggaacagcag cccctaggct ctagcaggag tgggtagggg 13740 
cttttctggc ttcctactgt gccagcagga tagctgggcc tggcactgag cccaaagatc 13800 
acatgccggg gcattggcgc agtgaggaac agacccttgc caaagctggc aaagaagacc 13860 
ccatggggtg cagctggtga agctgagagc tcaatgtttg ggggagcctg gcaaaagggg 13920 
tcctcccctc cctctgcagg ccaggatcgc aggttttccc tacatgttgg taattctcaa 13980 
acaatcccat ggccactgga gcaaagatca cagtgggcgg cggcetcggg agcagtggac 14040 . 
agggcacgca gtgcctttga tgccagagcc ctcgccccaa agtcaacaaa ctctgcagcg 14100 
gactttgcac ccggactttg ttttcaccat acaaggaaag ggacagatca caggccctot 14160 
cgctgccctc gctgagccgg aagctgcagc gtgagctctc tcaagcccca tttctaggtt 14220 
ccccaggcgc acccctgagc ccctactcgc ctattaagtt ctcctaatag cccttcaagg 14280 
tcttaatgta tgtccattag acagagggga aaactgaggc gagggcaagt gacttgaccg 14340 
aggttcctcg gcgagcaggg cgtggagctg agaacctcgt tattactgct ccccacacaa 14400 
ccctctggcc gttcttggaa gaaggctgag ccccgggggg gccagagtga cccaaacacc 14460 
atgggccgcc tgcggtaaca cgtgcggcca cgaaggggca gcagtttccc gcccggccgg 14520 
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gctctctccg gcgctcagta tccgtcccag gccaagaaga agaaactcgg ggaggagggc 14580 
ggagggggct gcgtgggagg gcgtggaaga tggacgtggc caggggagtg gcagctgcac 14640 
acagtggatg ctgttaagat gaagggaaag aacgtgggct ccgagatcac tggacacggt 14700 
tccacctttc ttcccgctca ctgcatggcc ctgggcgggt tgttgaaccc ttggaaacct 14760 
gtttttcctt ttttcctttt tttttgagac agggtcttgc tctgtggccc agactggagt 14820 
gccgtggcac gatcttggct cactgctgcc tcccaggttc aagtgatcct cccagctcag 14880 
cctcctgcgt agctgggacc ccaggtatgt gtcaccacag ccggctaatt tttgtatttt 14940 
tttgtagaga cgggatttcg ccgtattgcc caggctggtc tcaaactcct gagttcaccg 15000 
gatcttcctg cctcagcctc ccaaagtgct gggattactg gcatgagcca ccgcacccag 15060 
cagagacctc agttttctaa cctgtgccag caggaataat gatagctgcc tagcttggct 15 120 
gtgctgggaa ttaagtaaga tgaccgggta gcaaatatga agtattactg gacacagagg 15 180 
gccccaggct gggttagcag cggtggtcag ggctgctgct tcctggcctg agctcgaagg 15240 
agggccctca ttaccacctg ggtgagtcct cgtccaagcc tggcactgct gcgtgggaat 15300 
aacttctgcc acccaagttg gcagattgtg tgcaaagtta agtcctgact ctgtggggtg 15360 • 
gacttcgagg cctcttcatc ggacctgctt ccggtgactg cattcgcacc tcctcctgtt 15420 
cctggtttaa cacagcccag ctttcctcct gctgagccct ccctgggcct gctgtcaccc 15480 
tcgtgccgct gtgcctcgca gtgccactcc ctgtaccctg aatactttgc cctgcctctc 15540 
cacccagctg agagtcaggg cccctgtgag gctctgccca gcccgtcctc cgggtttctg 15600 
cctctgctga gcacttccct gcatgattgc ttctgagagt ccccccagcc tgtgagcttc 15660 
tcaggactgg gacagcttct caggaccgag gcttcctggt ctgcttgcaa ttttacaggc 15720 
gggcacattt tcccttggcc aacatcagag actggacatc tgcagatctg tgctagccac 15780 
tgagcaccca ggcaccccag caggtagctc tgtaaccaac ccattctgta aagctgaggc 15840 
tcasagagfft eaascgcctg gcctggggcc acagcctgcg tcagctgcag agccasgagc 15900 
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tgagatatgc acctgcggct ctgctcacag ggtcctgcac agactgctgc tggagccacc 15960 
tatgtagagt caagagagtt catgttaact ccctctcaca tccctcagcc agggtggggg 16020 
ctgacgatag acactcaggg atggcctacc ctccccaaca acccccgtca ggtttgccgg 16080 
atctccttgg aagaaaagtt ctgggcagaa ttccaccgtt ggcctggcct acactctcct 16140 
tagtggctta ggaccctcag cggtggataa gttgtgggca gaagagatgc aatcaggatt 16200 
ctcacccact caccccttgc cagccccaat aagctcaata agctgggctc ggtctgagga 16260 
agtgtccagg aaatgtgcaa atggcctggg acagccctgt gttcctttoa gtaaggttgc 16320 
tgaaggtgag gctgaaagtt ggagaaacag aagccagtgc ttatggtttt aattaagata 16380 
atggaatgta tgtatgtatg tatgtatgta tgtatgtatt tatgtattta tctttagaga 16440 
tagagtctca ctctgttgcc caggctggaa tgcggtgaca caatcatagc tccttgcagc 16500 
ctcgacttcc tatgcccaaa tgatcctcct acctcagcct cctgagtagc tgggactaca 16560 
gacacacgcc aactatgcct agctaatttt tatttctatt ttttgtggag actgggttct 16620 
cactttgttg cccaggctgg tcttgaaccc ctagcttcaa gcaatcctcc tgcctcagcc 16680 
tcccaaagtg gagggattac aggtgtgagc caccacacct ggcctggaat ttatttgtat 16740 
tctgcttata aaattaatac attcttattg cagaaaagtt tgaaaataaa agaaaggaca 16800 
aagaacaaaa agcgtatata atttcacagc tcagatctca ctgctattaa catttttatt 16860 
tactttcagg cttttttctt tctaggtaca tatgcagaga ttattttatt ttatttattt 16920 
tattttatat tttattttat attttttatt tcattatttt attttatttt attttattat 16980 
ttttagagac agggcctcac tctgtcaccc aggctggagt acaatggagt gatcatagct 17040 
cactgcagcc tcaaacacct gggctcaagc aatcccccca ctcagccttc tgagtagttg 17100 
ggactaaagt gtgagtctgg ctaatttttt ttactttttg tattgacaga ggtctcacta 17160 
tgttgcccag gctgatctca aactcctggg ttcaagcgat cctcccacct tggactccca 17220 
aastsctges attacaggca tsagccacca tgcctggcct aaaateccac tttttetcat 17280 
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ttacUaaa.ccoatggacactttgacatgWg.Bnctatgc.attga.cSac.gt. 17340 
«ca.c.aca.ca..atggcca.c.a.ca.c.a.cataa.cca«uacattaaaat.g 17400 
tgctgctgcttaga..t..c.ggcctg.ctccut«g.attct.ccaga.aaa«.tag 17460 
aa.cat«a.caaat.cccct«cagaaaaagccc.a..ggat«gg..gaaaaatac 17520 
-tgaattmacatuicttig^gctgggcacgg.ggc«acgcc«a,.ccc« 17580 

cacut^ga ggccaaggca ggtggatcac uga^ gag^E^ga ccagcCggc 17640 
eaacatggtg aaaCcggU: n^Caaaa atacaaaaa. tgccaggcgc attggccac 17700 
ctgmtccc agcacmgg gaggccgagg .gggtO- acgaggtcag gagatagaga 17760 
eca.cctggc,aacacgg.gcaaccccg.ctc.cc.aaaa atacaaaaa ttaECcaggc 17820 

g.gg.gg.gg gcgcctg.SS --SO- ^"^"^ 
acccaggagg cggagcttgc agtgagccaa ga.cgcgcca ctgcactcca gcctgggcga 17940 
cagagtgagac«:ca.c.caaaaaaaaa.aa,aa.aa.aatacaaaaa..agccggggg. 18000 
c^cgtgc accmaatc ccag«aca gggaggcg. ggcaggagaa .cgcttgaat 18060 
ccaggaggtg gaggngcaa .^cagaga tcgtgccao. g^ctccagc ctggg.gaca 18120 
gagtsacactcg^aaaaaaaaaaaaaaattctgaaggattgagaccttagac^oa 18180 
gg,c..cc^tccaagagcacaa,a.agct,..ca,g.at.caagcc.ttncaa«cat 18240 
,,,eagaat.«acagm.tt.catgamtcc.gc.att.a.a<aaaatgtattcc.a 18300 

gatatWgc atgmtccg gttg«gtt 

,aa.tggctgt,amgma.atgacatctgngaam«gatuc.tsaaaa,gg 1S420 
,^gtgtt.«..«aact.»attttgagataatt..ga«ucagaagat 18480 
,tgcaaaaatagucagagagncctgmccccct,^«aacccag.Bc.cct«>. 18540 
gttaacatcttaca.aac.acagaacaattg.caaa.c,aagaa.caacctgggcacaat 18600 

.c.a.uac.aaac..ca.aa.ct^ca.a.ctcacca.t.ct.c»..c.ccccm 18660 
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tctcttCQagtgttcaatccggaatcctacattatatttagttgtcatttctctttggtg 18720 
tcttccaatc tgtgacagtt cctcagtctt tctttgtctt tcatgacttt cattttttta 18780 

tacttttgaaaaatactggccggttgttttgtagaacgccctcagtttgggtttgcctga 18840 
agttttttgt gattagatcg aggtcatgca ttattggaga gggtgccacc gcctcgatgt 18900 

gcaagctcaatgcatcatatcagagggtttgtaatgtcagtttataccgccggagaccct 18960 
aacctggagc atttcgtgaa ggtgctgtct gccaggattc tccactagaa agttactatt 19020 
tttccctttt taattactga atgtctgagg ggaaatactt tgagactatg caaatatcct 19080 
gtttctgctt taacttcggc tcactaagtt tagcattcat ctatggatct cgcttatagc 19140 
aagtattact gtggagttct aatggtaatt ttctgtttct cteattcctt caacctttat 19200 
taatatgcttcttcctcacttattcattttgtttcagttgtttataccaacatggatttg 19260 
tggatattggttttattctttgggttgcaattgaatcctatcattattttgttagtcagt 19320 
tgttccatccgaa;ttggtcatta|gagcccttgaaamggctcccatgccttttt^ 19380 
tttttttgag accgagtctc actctgtcac ccaggtttga gtgcagtggc atgatcttgg 19440 

cttcctgcaacctccgcctcccaggttcaagcaattctcctgcctcagcctc^^^ 19500 
ctggtattataggcgctccaccaccttgcccggctaattttttgtatttttagtagagat 19560 
ggggttttat tatgttggcc aggctggtct caaactcctg acctcaggtg atctgcccgc 19620 
ctcggcctcc caaagtgctg ggactacagg cgtgagccac cacacctggc ctcctatgcc 19680 
attttaacatgcccgtcmtcttmc«tcctactttctgtgactgtaagaagctcca 19740 
ggatacattt ttgctgccct agacttagcc tcaatcagtt ctcagaaaag ctctggttct 19800 

ttttatgggatacttagaaaactagctctgtatggcctggcgcggtggctcacgcctgta 19860 
atcccagtac tttgggaggc cgaggtgggc agatcacaga tcacgaagtc aggagatcaa 19920 

gaccatcctggctaacatggtgaaactctgtctctactaaacatacaaaaaattagtcca 19980 
eececsrtffg ceeecHcctg tagtcccaec tactcaggag actgaegcae gagaacggca 20040 
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tgaacccggg aggcggagct tgcagtgagc cgagatcggc agccactgca ctccagcctg 20100 
ggccacagag cgagactccg tctcaaaaaa aaaaaaagga aaaagaaaaa agaaaactag 20160 
ctctgtatgc tagttttttt tttaagacag ggtctctctt gccccagctg gagtgtagca 20220 
gcacgatcac agctcactgt agcctcaacc ttctgggctc aagcaatcct cxtgcctcag 20280 
tctcctaagt agctgggtct acaggcatgc accaccgtac gtggcaattt ttaaaaactg 20340 
tttgtagaga tggagtctcc ctatgttgcc tggtctggaa ctcctggcct caagtgatcc 20400 
tcctgcctcg gcctcccaaa gtgctgagat tacaggcatg agccactgta cctggcctgg 20460 
ccaaggtctgtcmttttaaaagaagttgttgtatagtt jgtttttttttte 20520 
ttctgagacg gagtctcgct ctgtcgccca ggctggagtg cagtggtgcg atctcggctc 20580 
actgcaagct ccgcctccca ggttcacgcc attctcctgc ctcagcctcc cgagtagctg 20640 
ggcctacagg cgcccgctac cacgcccggc taattttttg catttttagt agagacgggg 20700 
tttcaccgtg ttagccagga tggtctcgat ctcctgacct cgtgatccgc ccgcctcggc 20760 
ctcxcaaagt gctgggatta caggcgtgag ccaccgcgcc cggcctgttg tatagttttt 20820 
atctcgagtt ttctagcgat ttaatcatat tggttacaaa aaaggatgat tttactacct 20880 
cjctttccaat gtttctacat attttttcat tttatctaac tgcattttaa aataaacttt 20940 
taattttaga atggtttcat atttacagaa aatgtgcaaa gatagtacag agagttcctg 21000 
tgtactccac acccggtttc cttattatta tcttaacgtg atacacaatt aataaaccag 21060 
taacattatt attcactgaa gtccacactt tctttttttt tttttctgag acggagtcta 21 120 
cttctgtcac ccaggctgga gtgcagtggc gcaatctcgg ctcactgcaa cctccacctc 21180 
ctgggttcag gcaattctgt ggctcagcat cccaagtagc tgggaataca ggtgcccgcc 21240 
accacgcccg gctaattttt tgtattttta gtagagatgg ggtttcacca tgttagccag 21300 
gatggtcttg aactcctgac ctcgtgatct gcctgcctca gcctcccaaa gtgctgggat 21360 
tacaggcgtg agccaccgcg cccggegtcc atactttctt tagajtatcct tcctttttac 21420 
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ctaacgtcct tcttctggtt caggatccca tccagaaagc aacattaccc ctcgccatca 21480 
cgtcttcaca ggctcccctt gacgggaaga gttcctcaga ctttccttgt ttttgttgac 21540 
cttgacagtt ttgaggagga ctggtatctt agtctgtttt gtgctgctat cacagactag 21600 
ctgagaccga tacatgatac atgaaaaaaa atgtattctt acagttgtgg aggctgggaa 21660 
gttcaagacg aagttgctgg ttggtttggt ctctggtttc aagatggcgc cttgctgctg 21720 
catcctctgg agaagaagaa tgcggtgtcc tctcactgca gaagatggaa gcgctaaaag 21780 
gaatgaactc cctttgccaa gccattttat aatgggcatt aatccacaaa ggatgaaacc 21840 
ctgagaaaca tcaagcttta aagcactggt tctcaacctt tttggtctca ggagcccttt 21900 
atactcttaa aacgttttga ggatcccaaa aaaaggcttc tacaggttcc atcttttaat 21960 
atttaccata tcaaaaatta aactgaaaaa attttaaatt atttattc^t ttaaaataac 22020 
aaggataaac ccattacatg ctaacataaa tcatgtattt tatgaaaaat agctatattt 22080 
atcaaaacaa aaattagtga gaagagtggc atgtataatt ttttttgttt attttttgtt 22140 
tttagatgga atcttattct gtcgcccagg ctggagtgca gtggtgtgat ctcggctcac 22200 
tgcaagctct gcctcccagg ttcacaccat tctcctgcct cagcctcctg agtagctggg 22260 
actgcaggtg cctgccacca cgcccggcta attttttgta tttttagtag agatggagtt 22320 
tcaccgtgtt agccaggatg gtcttgatct cctgaccttg tgatccaccc gcctcagcct 22380 
cccaaagtgc tgggattaca ggcttgagcc actgcgtctg gcctaaattt ttgtgaatgt 22440 
ctttaatgcc tgccttctca tatttgtttc tgcattcaag ttattgcaaa atgttgtgtt 22500 
ggttgaagtt tgtaaagaaa atgtggcctc atacagttgt gtagttggaa aggcaagagt 22560 
attttgattc tctcttcaaa caactatgga caacctgctg ttacaaaacc agaatgcaaa 22620 
aagttgtagt aaatacaggt taggtgtagt gtggaatctg aaagcatgtg aatgaacttt 22680 
ctgagttttg taacattaaa gtccagttgc gttaagctac tgtgatagca tatagcattg 22740 
tcctaatact ggaattagta tcagaagtgg ggtgctactg ttaataaata aaaagaaata 22800 
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aataaatcat gtgatactgg ctcagaagtc aggcagtagg ctgtgtggaa cctgacatca 22860 
cgccatgtaa tacattggca accatttgat ccagctgtct gtcatgatga cttggaaagt 22920 
caaccacata cttacagagc ctgtagacat aggggaaaat agtataaaac agaatactaa 22980 
cagtggacct tggttcttgc cagttgcatt tagccaaata ttaaacaaaa gagatattct 23040 
tgggcagcaa ctggaccatc ttcaagtaaa agtgaaaggt aataaacaga gtccagacat 23100 
ttgtgcccat gcgggttaag aaaaatccag ttgcttctag acaccgtata tgaaaacaac 23160 
gctgaaaaca agcctttgag tggtaaaggc cgattaacac tcagcgcggt aacaaagacc 23220 
aggtgggcta acccgaaatg aaatgagaag cctgtggtga tgaggaggca gagaagtaaa 23280 
atcaagtttg agcatttcgt ttaggagagt ttgggctctg attacttgca catgcaaacg 23340 
aactggaaacaaacagatcagalgtctaccacttcttcgagggaattgcattgccaaaga 23400 
agtcatgaaa gcagactcta tactgattag gcattaaaac aaaaacaatc tttaggcccc 23460 
taaacttgcatgggcaggaagtgggctgtcaaagctgttcatcctctaaggtggacctag 23520 
ttcctagtcc ccagtataca cttcagatgt ggccctggag gacactggac atggaggacc 23580 
tcccagagga tgaggctagg gcttcatttc tccaatgacc tcagctgcct ctatttcccc 23640 
ttcttcctct ggaagtccta tcatcgttat tattattatt atcatcattt ttattttgag 23700 
ataaggtctcgctctgttgcccaggctggagtgcagtgacatgatcatggctcactgcag 23760 

ccctcccagg ctcaagtgat cctcctgcct cagcctcctg agtagctggg agtacaggca 23820 
catgccacca tgcttggcta tttttttttt cagtagagat agggctctca ctatgttgcc 23880 
agggctgatc tcaacctcct gggttcaaga gatcctccta cctcagctcc tgagtagctg 23940 
ggattcgggt gcacaccacc atgccaacta atttttaatt tttttttgta tggacaggat 24000 
gtacagtgtt agaaatggat tgcttgcaga ggcaggagga tcacttgagc ccaggagttt 24060 
gatcacactg tgaaccatga tcgcacccct gcactccaat ctgggcaaca gagtgagacc 24120 
ttetctcaaa aaaaaaaaaa aaeaaagaea eaeaeaeact caaaeatagg caaaaaaetg 24180 
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ggaaagctttatagtggacaaaaaggaacgctctaagtctgccctattggcatggtgctg 24240 
aaggtgggct aactagagat agggggtact atgtggttga ctatgggtgc atctttggct 24300 
ttccctgggt gatcctaagt tggaagcagg gacaaaaatt agggaagctg ttagttattc 24360 
atcacgttct ggcagtagtg gactggttgt gatagaagtt attgttttgg ccaggtgcgg 24420 
tggctcatgc ctgtaatcct agccctttca gagttcaacg tgggtggatc aggaaggagg 24480 
gaggatttgg gaggtcagga gttagcctgg ctaacctggc gaaatcccat ctctactaaa 24540 
aatacaaaaa ttagctgggc gtggtggtgc atgcctataa tcccagctac tcgggacgct 24600 
gaggcaggag aatcagttga acctggggag gcggaggttg cagtgagcca agatcgtgcc 24660 
caatttcatctcaaaaaaaaaaaaaaagttatcgtttagcttcctcgattgttactggac 24720 
gtagtaatct ggcttcctgc aagtctaact ttcagcagac tggctacatg ggctgtgtac 24780 

tgtagataag gcagtaagta aagcaaaaat tgatagagca tcaaggataa atagaaaatc 24840 
cgtaatcaag cagaagattt gaacacttca ctttcagtaa ctgataaaac aagtagacaa 24900 

aaaaaatcagtaaggatgtagaagatttgaacaacgtaattaacaaacttgacttgattt 24960 
acacgtctagaaccctgcagaacacacactttttcaagcatactcagaacatttatataa 25020 
agtgaccata tggtggacca taaagcagtt tcaacaaatc tcacaggagt aaaataacag 25080 
accgtgtttt ctgaccgtaa gtacagttaa cctagaaatt gaaaacaaaa agctagaaaa 25140 
accccatgtatctggaaattttaatatacactttgaaataacaaatggatcagagattaa 25200 
ttcaaataggaatttagaaataccttgaactgaaaaataatgagaatactataccccaaa 25260 
actgtggggt gcagctgaac agtatataga cgaaaagtat actcatatgt gcataccfla 25320 
aggagcgggg aggattgaaa gttaatggga ggcaaaagca ggtggatcac ttgaggttag 25380 
gagttcaaga tcagcctggc taacagggtg aaaccccatc tctactaaaa atacaaaaaa 25440 
ttatccaggc gtagtgaggc tgaggcaaga gaatcgttgg aacccaggag gcagaggttg 25500 
cagtgagccg cgattgcgcc actgcacccc agcctgggag acagagcgag actccatctc 25560 
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„^ aa^ aaaaggccag gcgcggtggc tcatgccg. aatcccagoa 25620 
«gggaggccgaggtggSCgga..acgagg«aggaga.gagac..oCggc.ago 256S0 

.cggtgaaa. cccgcCOa cmaaatac aaaaaaatta gc« gtggcSS^B ^5740 
ectg^g^ccagctactoaggaggctgaggcaggagaatg^gaacccaggaggoag 25800 

agc«gcag.gagccgaga.ogegccactg„.gggcaacagagagagac._, 2^^^^ 
.„c.caaaaaaaaaaaaaag«aatgggamaaca.cca.o.aagaag«agaaagga 25920 
..gacaaauaaccaaaaaaaaaaaaa«aaaagaa.a.a.camgg.caagac«.a 25980 
aagagag.ggctggg.gcag„c«g^«.cagca««gggaagcagagg 26040 
^,gcaga.cac«gagcccaggag«caagaccagcc.gagtaaca.agagagaccca 26100 
„aaaataaaaa.aaaaaa«>gccaggca.ggtgg.act^8^8Saggat 26160 
eacugagcc Uggagg«g agg«gcag. aagccatga. .g.gccac.g cacncagcc 26220 
.ggg^gacag ag^ggaccc tgtccaaa aaacaaaat aaggc«ggc gcgg.gg«c 26280 
uccaccac. ngggaggcc aaggCgagg tcagcagtt. gagaacagC 26340 
.,gccaacaaga.gaaacetoatc«actaaaaa.acaaaaaa«agnggg.gtgg.E 
glg.gcc.gua,cccagc»c«aggaggB^tn..ga«a,attnc««ce 26460 
.aog.cg«a«gga«gaat.cagaatga«ac.«ca,«gagc.cttcc«.«cct 26520 
aac.cagtggc«ccgaccccac.ctggt«cac„caccc«ctg«g«ca.cga 26580 
pagata«cctt«aatt«cacttgOgc««ccttaaccccccccgttggtgt 26640 
,,^«,c«t.acgcgacacctgcgttctc«gccct«tatca.ccctt 26700 

««gaggcggKcmcc.t««:agc.«^.accttcttc«gttK«ttgggK 
.^a^KtcaccCccc^aammcctctdccgcacccaJcaagc 26820 
^gtSgatctc^cctctactotcgggtocccccca^ccc^^mmcttc 26880 
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<210> 7 
<211> 29430 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> unsure 

<222> (4336).(4345).(4349),(4392),(4447).(4490) 

<223 > Identity of nucleotide sequences at the above locations are untnown. 
<400> 7 

aggggaaggg ccggctccgt agctcacacc tataatccca gcactttccg aggagagagg 60 

atcatctcaggccaggagttcaagaccagcctgggcaacacagcaagaccgcatctctac 120 
aaaaacttct tttaaagctt aaaaaaaaaa aaaaaagcaa agaggacagt tcaggagaaa 180 
agcc^ga ggcagcacac taaggaggag acgcagccca ggcaccagga ggggctggcc 240 
atgggcactc actcctccag caggcgagtg cccagcacca gctggcccac ccagacaccc 300 

aggacacggcctgaatggctccgtattcacgtgggtggtaataaacaagcaatacacata 360 
gccaataagg acaccttagt aatgttacat cataaacgct gcagatcagg gaaatggtgc 420 

agggtgaagtgggttggggggctgcatgctacatgagaagtgggtcggggggctgcatgc 480 
tacctgagac agagcaggcc ttgctgggaa agaaggagcc ggcaggcctg ggcaaaggtc 540 

ctggggtgggagcacactggagcagagtgtgggggtagcatggcgggtgctggtcctctg 600 
cececcttcc caccacgtca tgtgcccate tgcccaaggt ctctcgtttc acagccccct 660 
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gaagctcagg ggtcacagct acacagcccc cagatacctt ggcctgcccc aggtcattcc 720 
atccagtgat ggacctgctg acctctagcc tgacctctgg gcagcgtaat ttgagaagga 780 
ggagaaggga gggcaacaga cctggggcga tgagggatgc acagggtggc agacacctga 840 
ggctgcacct tggagcctca gttctgggtg tgggtggggg atggacaggc tgagggctga 900 
agcagctgggcccggccaccatcacaccccaggacccaccagatcaccatgaaaaaccga 960 . 

atgtcaactg gcagcccaga gtgcagaaca aacctttcag aaacacggtg gtgactgccg 1020 
catcatgaac ataaaataat tacgccctct ccccagggat cacccctgca ggagtttgtc 1080 
ccaagaaaca ccagaaagaa ggaaaacgtc tgagtcacaa tatttgctga ggccttattt 1140 
gtaatagcaa aaaaaaaaaa aaaaaaagaa caatctccag cggcaggggt aactagacta 1200 
ttgtctccgt ggaaaggtag caccaattaa ctagtaacaa aatgactgcg gtaacaacaa 1260 
aacgttcgac atgtcaacac caaaaaccac acacccagca taaccgtgaa ccatgatttc 1320 
tactagaatg aatggcagtt atgagaaagc accagcggag acaaagattg aaaaagtaaa 1380 
ggtggcctca ttagggagac aagtctctgg gtaatatatt gtaatactgg taaatatata 1440 
gtttttaata tattttttaa ttccaaattc catatatgtt cctatgaagc tatttctgca 1500 
aatatttttt tcaggaccgt acatcacaaa ggcaaaaggg ccaggtcagc tctccagctg 1560 
agagtgacca cttcagagca gacggcagac tccagggtta gcaagcctgg ctgagacctg 1620 
gcccatgaca atcactcaac ccctctgacc tcaacatcct gtctgtgaaa tggggataat 1680 
tactgcacct ccacatcaca gagtgcgagg cttaaacagg atgcttcata gaaaagcgct 1740 
caagaggtaacagccgggagggggtagtggttttcattaattaaatgttgccttcatcca 1800 

gccctgggcc agctccaaca caaagcacac accatccact cagactcagt tgcctggatt 1860 
caaagcccgg cctggcctcc agctgtgaga ttccgggcag gatttcccat ctcccagagc 1920 
ctcagtttcc tcattcatga aacaggaagt gatcattcct tttattttta tttttatttt 1980 
tatttteaea ceeastttca ctctaettec ccasecteea etataateec ecaatctcag 2040 
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ctcactgcaa cctcggcctc ccagtttcaa gcgattctcc cacctcagtc tcctgagtag 2100 
ctgggattac aggcacacgc caccacgccc agctaatttt gtatttttag tagagacggg 2160 
gttttgccat gttggtcagg ctggtctcga actcctgacc tcaggtgatc cgcccgcctt 2220 
ggcatcccaa agtgctggga ttacaggtgt gagccaccaa gcccagttga caactgcttt 2280 
taaagacacc tctggctgct gtggaaaaca gcctggtagt gccteaaaaa gttacacata 2340 
gaatgatcct atgaccagta attccactcc tacatatata cccaaaagaa ctgaacccct 2400 
ctactcatgt atgtacacat acaggtacac gcatgttaac agcagtgttc acaaagccaa 2460 
aacatggaaa cagctcaaat gtccataacc gatgaacgga taaatgaaac gtagtctatt 2520 
caccacctga cggaggtgag aggggccata aaaaggaatg atgcataaaa acgaatatta 2580 
tggccaggta tggtggctca cgcctgtaat cccaggactt tgggaggctg aggcgggcgg 2640 
atcacgaggt aaggagttcg agaccagcct ggccaacacg gtgaaacccc atctctacta 2700 
aaaatacaca aattagctgg gcatggtgga gggcgcctgt aataccagct actccggagg 2760 
ctgaggcaag agaatccctt gaacctggga aacagaggtt gcagtgagct gagattgcac 2820 
cactgcactc cagcctgggc gacagaccaa aactccgttt cggaaaaaaa agaaaaaatt 2880 
agccaggtgt ggtggcgggt gggtccctgt aatcccagct ctacttggga tactgaggca 2940 
ggagaaccac ttgaacccgg gaggtggagg tagcggtgag ctgagattgt gccactgcgc 3000 
tccagcctgt gtgacagaag gagactctgt ctctaaaaaa caaaaacaaa aaag^cccga 3060 
cgcggtgtct tacacctgta atgccaacac tttgggaagc caaggcaggc agatcatctg 3 120 
aggtcaggag tttgagagca gcctgggcaa cacggtgaaa ccccatctct actaaaaata 3 180 
cagaaattag ccaggtgtgg tggcacatgc ctgtaatccc agctactcgg gaggctgagg 3240 
caggagaatc gcttgaaccc aggaagcgga ggttgcagtg agccgacaft gcaccattat 3300 

actCCagCCt gggtgacaga gtgagattCt gtCtCaaaaa aaaaaaaag^^ aaaaaaaa^a 3360 

ctaaacaaaa gcaaaaaaac caateaetaa tgttgtcaag tgaacttcat cccaatgsga 3420 
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atgcagataa tttgtttaaa aggcaccatg cacactgggc aggctggctt cccctgggaa 3480 
cgtcttcttt tgcctggatt cccagttggt ttaatcgggc gtagaacact ttcttcaatc 3540 
cgggattcag gcacccctgc tcagcacaaa ctcagtacac cccgcactct gctgtgggtt 3600 
cttggcacta ttaggagaat gtgagggggt gattcagatc tatctctagt gggtgcatgt 3660 
ctgccactcc caggaacgcc cacttctggc aagtcagtgt cagagaaagg ccagctcgtg 3720 
gcccctcctg ccttgagtcc caggacccgt gatcagtcct acccggagca gaatcaggag 3780 
tttgaaaacccaagtgccaacaatctcattttaacccatgtaagcatatccaatatttat 3840 
atatagaattcataacagatgtctgggcttccattccaatagcctatattttaeactgtt 3900 

tatttacatg gttacaccaa acaagactca attcaaggta acccaatoct ttgctactat 3960 
accaaaataageaacattttcagtccatgccttatatatattcaccaagcattacactag 4020 
gcctccaact gctcatcgga gcaagctgca gcctggacac aagctagaga ttaatcagto 4080 
aggaatgatc ctgcgtccag tgccagcatg atggaagaga cagagaaaca gaagacatca 4140 
gggctccaga gtcaaggagc ctgcaggtta gttgggcagg atatacacac atacacacac 4200 
acacgcacac acaaaaccac ccaagaagaa aaggtgggat gaatgcatgg acaggtaatg 4260 
cctggagcct ggggatggat aagctgactg caggtggccc aggcaggctt cctggaggaa 4320 
gaagacctgg ctgtangtgg ggtangcaag ctttctaaat ggggaaaatc tggctgtggg 4380 
tggagttggc angtttccga aaagaagaaa agctgactat gggtacacct ggctgttggt 4440 
ggaacangca ggcttcttgg aagaagaaaa tctggctgtg ggtggatcan gcaagcttct 4500 
tggaagaagt aaacctgact atgggtggac caggcaggct tcctagagga agaagaccgg 4560 
ctgtgggtga accaggcagg cttcctagac agaggaagat ctggctgcgg ttagagtggg 4620 
caggcttcta agaagaggaa gggctgactg tgggtagacc tggctgtggg tagactgggc 4680 
aggcttcctg gaggaggaag agctggagca ttgaaaaaca aacatgactt ggtgaatgtt 4740 
sagcatgccc aggcctgatc cccagaggca attacgcact caagttactt aattctactc 4800 



wo 01/92891 

74 

acaatgcctc acaaacaact tctctgacac ctaacacagc tctgggcacc ttctagcttc 4860 
agctcctcaa agcagttatt cacgctacta ccctgcacac ctcctcacac cccaacccca 4920 
gggacaggag ttctgccaga tgccaaagct cctgatgcca aagcctgggt ctgcttccgg . 4980 
gctcctcttg gtctaactgt ccaccccgca tcggcatgat gtgcaaaaac aaggctttgc 5040 
aatctgccct gatgcctggc ggagcgagtc cctcccgatt cgtctccttc agaaacacct 5 100 
gggctgccct ggtcctgtta tacccccaac acattctaca gtcagctccg caagttccac 5160 
aaagatcaac gctggcgttt ttatggcatt ttatttacag tttttacaat ataaaaaagg 5220 
aaggatgcca cagctcagcc agcaggacag acagagatct atgatgcttc tgctgcacca 5280 
ttgtttgtgg tcaagaaagt ctgttttcaa tgatttatta aattgtggtg ggagatggat 5340 
ggtggcagtg gttaccagca acatgaatgt tcttaatgcc actgaacttc acacttacaa 5400 
atggttacga cgataagtgt tatatgtatt ttaccacaat taaaaacagg taaatgcagg 5460 
ccgggcacgg tggctcacga ctgtaatctc agcactttgg gaggccaagg caggcagatc 5520 
acctgaggtc aggggttcga gaccagtctc gccaacacgg tgaaactctg tctctattaa 5580 
aaatacaaaa attagccaga tgtggtggtg catgcctgta atcccagctt ctcaggaggc 5640 
tgaggcagga aaatagcttg aaaccgggag gcagaggttg ccatgagctg agattglacc 5700 
attgcactcc agcctgggtg acaaaagcaa aactctgtct caaaaaaata aaataaaata 5750 
aaaataggta aatgcaaaca tatggtatag taatattatg ggctatiatg agctacaaaa 5820 
aagaatgact tgggactaca gttacagccc tcattcagga atttgtttta aatgtgggtt 5880 
ggtcgctaag gcatgtacac aacattttga cgttcaaata ttcctagatt tggacagtga 5940 
gcacccctct aagctggctc ttctgtccca gaggtcccca ccagtcctcc agaacttctt 6000 
tgctttctta cacaataaga tgccccatgc tcggcttgta cctttccttg ccccagccct 6060 
agaaccagct tcttcgtgga caagctctga ctcctttggg tggagaatgg tattcagaaa 6120 
cccagacctg eectctggtg tgctcactgc tacttggggt cattgcttct aggcctctct 6180 
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gctgatggag gtaggatata cacgtacagt cttccctctt cccagattcc gtacttgagc 6240 
tcgcctactt gctaacattt atttatatcc cccaaattaa acctcacagc acttctgcaa 6300 
tcactcactg acttgcagag tgtgaaaaaa ctgagtcacc atcacacgtt ccaaactgag 6360 
gtcaactgag gccacaacgc cccatcttct tgctccggct gtcgagatgt aagcaagtgt 6420 
ccttctctcg gtctagctag tgccatgctt tccacatcac tgtgcttttt gtgggcaatt 6480 
ttgctgtata aaatgtcccc tgcacatatg ctgctgtgta gtgctcctag gtgcatgagg 6540 
ctgccccacg ccttacagag agaatatgca tgagaggctt tattcaggta tgagttatag 6600 
cgtagttggc catgaattca atgttaatga atcaacaata tacagtaaat aaggtgcttt 6660 
■ ttagagacag ggtctcactc tgtcacccag gctttagagt ccagtggtgt gaccttggct 6720 
cactgccgcc tcaacctcct gggctcaagt gatcctccca cctcagcctc ccaaactgtt 6780 
gggattacag gcgtgagcta ctgcactcag cctaaataag gtgtcttaga aacacacata 6840 
agacaaggtt atgggctgag tgcggtggct catgcctgta atcccaacac tttgggaggc 6900 
caaggtggga ggttcacttg aggccagaag tttgagacta gcctgggcaa catggcaaga 6960 
cctcatctgt atattttttt aaatcagaca ggtgtggtgg tgcatgccta tagtcccagc 7020 
tactggagag gctgaggcag gaaaatggcc tgagcccagg aggtcaaggc tgcagtgacc 7080 
catgattgta ccactgcatt ccagcctggg gtgacacagc aagacgctgt cttaaaaaaa 7 140 
p^^aaaaaaa aagccaggtc aggtatcgaa cagttggcaa aaacgttgtg acctgaggct 7200 
cacaggaacc tagcccgatg tttcccctag gagcaatggt tcagtattca ataattcagg 7260 
gttcccagtg actttatgga gcataacttt caagaataac aagaaccaac tgtacgtgtg 7320 
tatgtatact cacactttta ttttatttta ttttattttt tgagacagag tctcactctg 7380 
tcacccaggc tggagtaaaa tggcgtgatc tcgactcact gcaacctccg cctcccaggt 7440 
tcaagtgatt ctcagcctcx; caagtagctg ggattacagg tgtgccccca caaccggcta 7500 
atttctetat ttttagtaea Eacesaettt ceccacattg eccacectee tctcaaactc 7560 
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ctaacctcaa gtgatccacc cacctcagcc tcccaaagtg ctggaattac aggcatgagc 7620 
tgccgtgcct agcctacata cacttttata cacacatgca tctatgacta tttctctatt 7680 
tctgtgcatg tgtgcgtggc agtacctaca gtttcagcta tgtgtctggg tactgtctcg 7740 
tccaagtttg taagcacctt ctccaaagtg caaagcctgg cttgtgttac tatccatatg 7800 
tttacttatt tgctcaatca atttacttat tagctccata accagcttcc catctgctcc 7860 
agtagcctct gctgtcagtc acctctgcac cctaccccac cttgcttccg gatgctggat 7920 
gccaatcacc cccgacacct ctacatagca ccaccctcga catgctgctt ctttatttct 7980 
tatttatttg tttgagalgg agtcttactc tgttgcccag gctggagtgc agtggcacga 8040 
tccaggctca ctgcaacgtc cgcctcctgg gttcaagtga ttctcctgcc tcagcttctc 8 100 
aaatagctgg gattacaggt gcccaccacc acgcccagct aatttttgta tttttagtag 8160 
agatggggtt tcaccatgtt ggccaggctg gtctogaact cctgacctca agtgatccac 8220 
cttggcctct caaagtgctg ggattacagg tgtgagccac cgcgcctggt ctgcttcttt 8280 
aaatgccagg caccaacatt tgtgcaatgg ggtgggagga aagaacaggg aggagagcac 8340 
actgccggcc cctgcactga atccactgat caatctgggg gcaactgcca tctccatctc 8400 . 
ctgtcttcct atccgtgaac atctactgca gtc?ctctcca atgtccttct gtaaagttgt 8460 
attatgtttt gcatacaggc cttgcatatt agttctcaga tataatccat atactttata 8520 
taaaattcaa accacattta aaaaaataaa actagcatga ctataacgga gtctgcaaca 8580 
ttctoacaga ctttatgata aaacatgaaa cttcaaagat acttagggtg gggcagggac 8640 
aatgtttaag gctgcctgga agcctcccca tccctgagcc agaaagtcct atctcccctt 8700 
caaggggaaa tgcttgaaaa agcactgatc aggctaaaat gacagggatc agggagtaat 8760 
caaagtacaa gtgagctggt ctcctccatt ctgagcacag caaagttcag tctctccaag 8820 
tccaagaatc atacacctgt ttgccaagaa tgaagttcag gtgtctacaa gtggctgaaa 8880 
atattcatts ctsssccatt aacaacattc ttsecaaaac cataccttae cttctcetse 8940 
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aaatttctta aggtagaaga aacaggaaac acccaggctc gcttttatgt agacagttcc 9000 
atgaagccag ggaccttccc cacatccacg tttcaattac ctgcacgcag ctcacagtgt 9060 
attcaacatc tacgcgtctc tcctactggg gtggcggtgg ccactcaaac cctcatgcag 9120 
ctacgatgac cgcaattttg gcaacataat ttcatgtttt tccttgggct tttacccaag 9180 
tcagtgacac aattctgcag ttgtctaaag attcaaaatg agggacttga catttacaac 9240 
aataataaaa tcttgggttt cctttaacca agcacatgtt ctgcctttta gagaaagctc 9300 
tgcaaactca agctggagtg ggatacttgc tgacatcttc aagcacccca ggaatagctc 9360 
tactccccca tttccacctt ggctgaacca tctatatccc accaattccc ccaacatccc 9420 
tccatccgtc catccatcca cccaaggacc tgctaagcca ggaggtctct cxcatctacc 9480 
ccacagcctg gcctcagccc acaagggctc tctctacatg aatcccaccg caccagagta 9540 
gaccaagtct cccgtagact ccaccctgac cacctccatg cctccagcca ttcccacccc 9600 
taaaaaccct ccctggtctc tacacccagc tgatgaatac ttggctgaat gtgacctggc 9660 
ctcctggacc caggtgaagc ccacgtcctc cgtaagcccg ccagctcacc ctgcctctgc 9720 
accttcactg gagagagccc gcacttcacc tcctcagggc aggcatggct gatgccaccc .9780 
agtggaatct ggtgcaaagc agggcccggt gcagagcagg gctgcctgca gagcaaggcc 9840 
ctggtgctgg ggccgagcac ctccaatgct ggccgtggaa ccatccctcc cattccaggt 9900 
gctgtctcca tcaagaatga gcgagctgct gacatttgca tgacaataat gaataaatac 9960 
catattttgc ttcaaatcca gaatagatgt ggccagggtt ggcatatgac tgttgggaaa 10020 
ggacagtttg cctcttccca aaccaacttg gattataaaa agcttttctt aacgaccaca 10080 
agagcggagg agctcagggg cagacaaaag gaaggctggc tgcagaaggc gggagagtgg 10140 
ggccttcagg ggcgggtggg gagagagaaa gcctggagct gcacccccaa ggtctgtgta 10200 
caticaggtgc tacagaataa caccacctct tccagcttgg cccccacctg ccctctccca 10260 
Bcccafftcac ccaeacasca ccccactccc cacacacacc tcacatctsc ccecctcaca 10320 
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ctcaccagct tcggctctca atgcaacctg gaacctgccc ttggcctctc agctcagcca 10380 
cccccattcc tgttggcccc tggcccccca tcgaattctc tctaatccta atgcacacac 10440 
ttgcacactc aaacacacac acacacacac acacacacag cccagaggaa aaccataatt 10500 
gactgaggtc caggcaagtt tcccgagcag ggaccacatt tcaaaggtca gggaagcagg 10560 
cgaacaggaa acatacaggg ggcacgtttg ggggtggagc aggaaataag aaatcacttg 10620 
caaaagataa aaagaaaatg aggtagctgg tttcagacac ctcggagcac acagaacagg 10680 
acaggcgcct ccgggtcttc cctcaacagg gagatgggcc aggcaggtcc ctgctgctcc 10740 
accgcagagc tgggggctat ggccctgaca ccaaggccct ggggcaggcg gggaggcagc 10800 
tgttctcctg cctgtgctcc cgggcagggc ctggccccac aagggaactg gccgaaggct 10860 
ctgcttggct actccggaaa gtcctgggag acaagcaaag gacttgctag gtcactccaa 10920 
acggcccaga tgtgacaact gtgaagaagc cacaccaaag caaggtgaca gaacaatgtt 10980 
ggtgacgtca ggttatcagc ttacgctcaa ctccacttac ccggactcac ccgtaacctg 1 1040 
ccgtctcttc ccaaccagta aaggatgcct aggtagaggg gcacaaggcc tggagcataa 1 1 1 00 
ttaccatttt aaaggctctg agaagtcctg cggtgaggaa gcctagttca ctttotctcc 1 1 160 
cctaggattt cccaactgcg cctgatcaca gaacattttt tcatttecac tcaggaaaca 1 1220 
tattttgaaa aacactggcc tagaggcaga agtgaaatgg aaaacacaaa agtaaaactg 1 1280 
aacaggaggc actgggcaga gaacggtcag aggcgccctg aatcctggac cggtggagat 1 1340 
ccccagcttg gcatgctccc ctccctgggc ccagaccgcc tccccccatt tcctggataa 1 1400 
gaaggctaat gcgcatcagg gtgaagggct tgcctgggct acacccccag gctcgcccca 1 1460 
caccaatcgc gctcctgcga gagccagtga ctttcttgat ttggctactg tggaattgtt 1 1520 
tgcaactaac caccccagat acagatacaa atgacaggat gatcagatgt aaaggaccca 1 1580 
caggtctotg tgatacggct tcatgcagcc agcatggcta gtgccgtgca gaatgagaat 1 1640 
gaccccaggc aagtccttgc ctcccagacc cagaacccca tggagcccac cagggctggt 1 1700 
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tcacaagcac tgtctgggtc gggcagagat tccagcaaga ggagggaaca tccatgcacc 11760 
ggagccagtt accagaagca aatcgcctct tccaaaaccc aggctattaa tggagtccac 11820 
tgttgagtgg agctggggtc tagctatgga atactgcaca gcagagatct tcctgagaga 11880 
aagcagtttt ccctgaaagc catgtgtcct ccactaactg tgttttaatt gggcgaacgt 11940 
ctgtatctca ttgcagtggc cgcgcatgtg ctgacaaggg gctgggggcg gggtggggag 12000 
cagaagctca ggggcctggg agggaaggaa acaggccacc agggctcccc agaaggcatg 12060 
tatctctctc acaaacacac gcatgcacac acacgtgcac acatactctg caagccctga 12120 
gttagcaact gtggaatgtg accagctcag tgatcccagg acaagctgct agggaatatg 12180 
acatttgatt gatgtctgca aatgtgcgtt ttcactaatt agaaggttta gggcagagca 12240 
gagaaaaata tgtatttcag agtcccagtt tgacctgcca gaaaccagcc cattactaac 12300 
attcttattt tcaacaaaat atagcattct gattacatac catcttggtt ccacgcctcc 12360 
tgccttgcca agcccccgga agcggcccaa ggccatggca aatagtgaga gaaacagttc 12420 
cagggtggag actgactcag gggtgtcagt cagtggggcg ctgatggccg gtgggaggcc 12480 
agcagtcatc accctctcct tgggacagtt gagtagctct cccccagggt catgtggcca 12540 
ctcaggttca tatgggaggc gagaggagtg gcagagtcca ggagagtggc tccgaagtca 12600 
ctgttccctc caggcctcag tgtcttcatc cattaaatgg gtaggctgag gtctgggatg 12660 
acaaggaggg cttgcactta ctgaaaccca tgggaggctg ttcgccgatt tcttttattg 12720 
atggaagaaa acactcgtat aattcaagta ccaattaaaa ggcaggcact ggaaccaccg 12780 
tctgccaatt cctagttttg cctataccaa atttgagcaa gttaattgac ctctcccagc 12840 
ctcagtttct tcgtctgtaa aatgagggta gggatggccc ccagcccaca gggcagctgg 12900 
aaggattaaa gaaatcaaac atctcttaga gcccacctgg cacactgtga tacacaacaa 12960 
atgttagcta tttttgtcta tgaagtctag attttatatc ttgggtgttc taaagcagga 13020 
tacatttatt taaaaacaag gattttcatt aaacacgtac ^eeacagaca gcaaccccat 13080 
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ggagactgct cttaattcag gccagtatcg aaacgactct aactacaagc tttatacagg 13 140 
tctcttggct gtccttcaaa tccaactaag gtggtacttc tgaagcactg tgcacatgtg 13200 
tgtgtgcatg cacacgtgtg ggaagggcgg gctcacggat ccctcaggta ccccacccac 13260 
gcagtctcaa gtcacaaagc gacagagcag ccgaggaagg tctgtgcccc actggaccct 13320 
cgtgaagcca ccaactctac ctctgcgccg tgtcctgcag actgggctac cctttgggtg 13380 
gggaccagca tttgatgcaa gaaaggcaga cagaaaagga aaagggcaag ttcgactcca 13440 
gataacacag acagtaccaa gccccagggt ccataaatgc cacgcagatg gaagcattta 13500 
ctgcgaggcc acacagcaaa cgcacggatc cagggacgga ggtgcagact gcggtgcccc 13560 
tgagccatga ccctgcaaat taccaccatg ggaaaggagg ctgccaaacc ccccgacagt 13620 
cggctgggct ggcacagact cgtggtttoc atcgaggtgg gaggaggtgg gacgtcccag 13680 
cccctccccc atgcccactg cagagggaag cggccgtttc ccctgtgtgg ttacaaaggt 13740 
ctcattgttc ttcctcacag ggaggaaact ggaggaccga gctcagaacg cattttagaa 13800 
ctggcagaaa agaacatctg gggaaggaaa cacatttcag aaacaaacat acctttgtac 13860 
cagcttttat tttctttaag tgttgaaaaa ataataataa taaagacatg ccaaatttat 13920 
catcgctcta caaaatccct ttattgagca aaacgtggca gctctacttt caaatgatta 13980 
ctgttcctgg aaaattgcag caacgtggat gccaaggccc gaaggccgcc atcagcagcc 14040 
aaacaaaaga tgccacctcg ggctccgcga cactgtacca tgccagggaa ctggacagat 14100 
ttggggaatg ccacggtttg cctttaaccc cttgcctcct ggtctcctga tgcatctcag 14160 
aggctaacat tctttgagga actggcattt cttagttgta aatatgcatg tgggtttggg 14220 
agctgcctgc aaagtccagt gttgacgaic agctttgatt tccttggaat caagtttacg 14280 
tgtcgagtct ggaagttaag aagaatttgg agaagctgag cactatggtg ttgcaggccc 14340 
tgggtgaact cttccaccaa gcattcattg tggactgaca gcgtgcgagg ggctctgcag 14400 
ffcaeetccac asaaceaaac acattccetc ceegeeaaac ctecaseaaa ectccctctt 14460 
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cttcctaagg tgccgggcct agcttcatgg gtccctaccc tccacgcctg tcacactttc 14520 
tgagtctcat gtgggagctg cttctggttc ctgacttcac tcagtcctca taggaggtgg 14580 
aactactgtc accccatttt acagatgggg agactgggca caaggggacc aagaaaccaa 14640 
tgcaaagtca cacttgtggg atcagtgaca ggggagatca attcccaggt tctttctgca 14700 
agagttaaat tgttttcatg ctgcctaagg gggggcaact gaaagaccac tgcatatctt 14760 
tgccaaaagg gtcaagcaca ggagccgcag ccagtgggtc agatccgcag aggcgctggg 14820 
gtgaccctcc ccatacctgg agggatgctt gtcccctcct ggccttcact gggtcccctc 14880 
atgaccgtgg cctcccagga cctcagcaca atcccggtcc tgtgctccag gacaagccct 14940 
ccgtccccaa gactgtgagg aaatggaacg aagaggggct cgctgcagcc cagcacccac 15000 
actgcccctt ctcaggggca agaaccgtcc tggaggactt ggctttggag ggggagcctg 15060 
ggaggccagt aagtcaacaa gcctctactg ctcatgggtg ggatcccacc gcaggccccc 15120 
acctgctggg gcgggcaggg acgggcggca cagcttggcc agggcagata acccccacct 15180 
tggccagggc gaaggcagga cacgtgggct ccagcctggc cccaccatcc ctgcacaaca 15240 
ctgggcaaag tbcacgtttt cctcaactgg gtgttgacat ctgcaggaca ggggcatgga 15300 
ggtacagagc gctgaagcca cacagcaacc taggagcgag actccatgcc tccccgggga 15360 
cccctcccca ccatgaggac catgaaggct tcccatgtgc cgcaaggact ctggtgtgga 15420 
gacacacgtc tcctacacag ccaggcctaa cgctcttgta actgggtggt cccacctggg 15480 
ctcacagctg gagggccagg agctcaaggc ttcgcagggt ctgctctcat cccagaggcg 15540 
atggggagcc acagcaggct gcaggagaga gggtgggccc cctccacttc agaggcccca 15600 
tctggcccac agactggaga gcacatctct cagcaaccac ggagcgccaa ctgcgcacag 15660 
ggcctggtcg tcagagcggg gcaaaggcac tgaccgtcac ggccagggcg agggaagacg 15720 
ggtgggcagg gaccttgggc agagggggaa gaacctggtg cccaggctgg ccctgccttc 15780 
aecagtgaag ctgagtgggg aggcgctgat gcagggggcc agaaagggct gctggtcagc 15840 
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cgggaggagc cccccacaga ggaagcagcc agcccagacg cagatggcag ggtcccctca 15900 
acaatgtcct ctgaaaagga gaggcgggga ctgctctggt gacacctaca aatagatagt 15960 
cagccctcag ccccctgcca tacttctgac aaagcagagg cccccagggg aggcgcaccc 16020 
gaaggtacct gcacctgtcc cccagactcc tagagcccac ctgaccccat cccaccaggg 16080 
ctccagctac aaaataaatg ccgaggccag ctaggcaagg acgcacactc ggtaccgact 16140 
gaataggctc cacgttgtca tgagcgcaac ccacaggcca ccaggccaca ctatgcagag 16200 
ctgagatggt ttcggccaag cagcctctca gctgagctga acaagtcxag agtccccggg 16260 
gggtcgtcac tatggagtaa caattgcgat gcgatggtaa ccctaacagc taaccgtcac 16320 
tgagccaggc cctgagctag gtacttttca acgctgcctc tctgcagcct caggacgagc 16380 
ctgtgggagc ataaagatca ttccctatca cggatgggga aactgagctc tgaagcagtt 16440 
aacgtgcttg tcccagaccg cagagctagg agcaggacac aacagcaggt caggcaggaa 16500 
cgggtgaggg gggcctgcat gggcttctct ggaggctgcg catacacgca acccccagga 16560 
ccccgaccct gcacctgcag ctcgctactg ccccctcagt gactccagca aacctcgggg 16620 
taggggaagg aggctgggaa tacctcgggt gtccgaaaca gcagcttctg cttggaggcc 16680 
actgctgcat aatggttgct gcccagcaca ccccaagcca cctgtgccac ctgtggtgac 16740 

cttccagcatgccttggtgaccaagctggccttaggtgctgtgggcagccaagaatagaa 16800 
cagggcccac ccctcctctt cacactaaca caaagcaaga ggcgggcact tcgactgagt 16860 
gcatccctct agctcaaggg cctcacggat cacaggggtc agggcaagat cccaattctg 16920 
cattcccgtc tgcctttcat cctgctctgc caacaacagc cagtgaggct ggggacatcc 16980 
ctgaacctgtttctcacctgaaacacatcataccattggaccccagccctccgggagagg 17040 
ccctaatccc tgactgtggt gagatcagat cactggttaa gtacccagaa gggccttggt 17100 
caggggctcc aggggtgggg ggtgatgggc gtggtggtat cccgctctgg gctatagtcc 17160 
accctgatgg aggaggtctg tggtcagaac cgggctgtgc agggcacagg agcccagagg 17220 
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gacccccaga gctcacctgg tggtctctga gcagggctcc ctcaaccctc agagaaaagc 17280 
acagcaagga ggccgccxag agcccagcgc ctagcaccca gtggcgtgcc agacctgcct 17340 
ggatcctgga gatctctcat caccctccaa gtcagtcatg cccaacccag ggacccacag 17400 
cccacggggc cgtgaaggtg tgctgagtcc aagaaggcct tcgacactgg gaagccaagt 17460 
ggcacctcct ggtgtggagc aggcggaatc ccaccagcct ctgctctgcc agtgggcaca 17520 
gctggacgat gagcagaagg ggctgttgct taataaacgt catttcctta agaggataaa 17580 
acctttcaaa acagatggaa attttttttt aattaaaact ggtggccaaa gagatggaaa 17640 
gcaccccttg tgcctccctc ccatcgtgac ccatcctctg cacacctcaa gctgttcgct 17700 
gcccaggtgt ctcctgaggc actgggggcg ggtgagaatc cgtgagccct cggccagccg 17760 
' tggctctctg gagctctgcc ccaggccatc agggcacacg ccgggcaccc tgggggccac 17820 
acagggcaga gcccagctgg gtcagcacac agggccacac tgggcacaca agtctctgag 17880 
^ cctcccctgt ggacgcagct ctcactatcc caccccacta ggtcccgggg atctgtccca 17940 
cagggtgata tgctgtcaca gaccactacc agagccatgg cctgctgttc cgcccgcagc 18000 
caggtagtca cttgctccac agggacaggc aacgccgcac ttgggggctg ctctgcggca 18060 
ggactagagc tccagcagct cagccctcct gagaaggaga actccatgct ctaagaggca 18120 
gacgcagcgg acggcaccaa agccaccaca agcccacggg gccctgcatg gcaggtcagg 18180 
agtccctgac cactcgctct ttgtaaccag agctgcagtg gagtctacga ggcaaggact 18240 
gtgggcggca gtggccacag caaatgaatg agtgtcccaa gggagcaggc ggctgcgggg 18300 
aggcacagcc gggacccagg agtcctccgg cactgcagca aactccctgg gccccctgag 18360 
cagcgaccag gtggcaagtg catgaactcc cgggggcata acctgggagg gtgacactct .18420 
cttcgtgttc aaattcttga gaacgcatta aaaatatcac tcagtcacct actctatagt 1 8480 
tttaactcaa aagtaccaaa gtagccaggc gcggtggctc acgcctataa tcccagtact 18540 
ttgggaagct gaggcaagag gatcacttaa gcccaggagt tccaaatgaa cctgggcaac 18600 
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atggagggac cccatttcta caaaaaaagt gttttaaaaa attacctggg cctggtggtg 18660 
tgtgcctgta gtcccagcta ctcaggaggc tgaggcggga gaaccacatg aacccagggg 18720 
aggtagaggc tgcagtaggc tgtgatggca ccactgcact ccagcctggg taacagagtc 18780 
agactctatc tcaaaataaa tttaaaaagc accaagccag gcttggtggc tcacacctgt 18840 
aatcccagca ctcagggagg ctgaggcaag tggatcacct gagtcagaag ttcgagacca 18900 
gcccagccaa catggtgaaa ctccatctcc actaaaaata caaaaattac ccaggcgtgg 18960 
tggcgggtgc ctgtaatccc agctactcag gaagctgagg caggagaact gcttgaaccc 19020 
aggaggcaga ggttgcagtg agccaagact gtgctactgc actcaagcct gggagacaga 19080 
acgagactcc atctcaaaaa ataaataaat caatcaaaac caccaagact ttttaatata 19140 
aacatttatt attccataat tccttttttg catgattaaa aatgtttata taaagtttcc 19200 
tgaaaatggt aagaatgcca agtgaaggct gcaaatgccc aagcxcccac cgtggcatct 19260 
cacggagtct gggccctagg aggctggtgg gtaccacgtg gacccgagac ttcacagtca 19320 
agtccctttg gggtacactg ggtttcccac accccagaaa tatgggctct tactgcagga 19380 
ccatgggggt cctcacactt ggcccagaag ctgtcacata gccagacagg tgttctacaa 19440 
cctaggctag agggagctca tgctccagcagaattcgagc cagaggaggt aaaagatggg 19500 
taagatctgc tccctggaca gatgaggcct tggcctcaga acagttactg atcatctacc 19560 
agacatcaca ctagaggcag aggggcgcag acgaagacag cccctgtcct caaggccctc 19620 
ccaggttggg tggaccatgg aaggttccag acagatctgg caagagaagt gcccacacca 19680 
ggggcagaag atgggcaggt ctgctcaggg cggcacggcc tgccaggcca aaaagttcca 19740 
acttcagatg ctggagaatg ggcacgactg tctgagaaag ggaaggatgt gatgaaaact 19800 
acttggagaa aaattaatct ggccagagca taagataaat gggcaaaggg gaggttccag 19860 
aaagcaagga gaccaagtaa aagctgatgt cattggctct gaatctaggc tttcactgaa 19920 
tatgcaccgc agggcctgta ggtaaagcct cagagcccag ggagtctgag tggaggagag 19980 
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ggcaggggac agagctgggg cctgtgtcta cagtgctcag gaggaatagg catggacgtc 20040 
agctcggagg ctccagctga agtgaggagg cggccagggc agcacggcca cgcccggatc 20100 
cagactcctt ttgggaagca agttcgctct gggggaaagt ttggagaaat ggcctttacc 20160 
cgcagaagca agccccagaa catatcttgc tccaaaacta tctcgtacag tgaggacgtt 20220 
aagcttcagg tcccctagag gagacagtct gctccttcct ggggcagaac ccaaggtggc 20280 
cagagcctgg aaggcaccca gcacccaggc tggtgtgttc cagcccaggc cacacgctca 20340 
gatagctattaatgccccgttgagcaatttcctgagagctttgccaggcaggtaccgcct 20400 
ccccatctga actaatacag gggtacatcc caaggaagaa atgaaaggtg cccacatttt 20460 
gctctgggat taactaggga ggggagtgat aattaactca gtaattatat ttgccatcgg 20520 
gctaatgcta aaattagtgt gcattagaat ttctttcctg agcagacacc ggagtgagtt 20580 
gggcagcagg agtggctcgg gcaagtcggc acaaagggca cctccagagc cttccacaaa 20640 
tgtcagcaaa acccacaaat gtcaaggccg gctccactgc acccagcaga tgaattcact 20700 
tccacagcct gagaccgcca gctcatcgga ggccatttaa aatccagccc tctgacacct 20760 
gctggatatc accatttacc gtccccagat caagagatca aagggtggaa cctgatagga 20820 
cggctctgaa gttcaccaca aaagcataaa cgtgcaagca gagccaatac gtcttttgaa 20880 
aaggacaatg aggtgggaat ttacataact gatcttaaaa tatgttctga tgcttcagag 20940 
atggagacag cagcattccg gtacacaaag acactcacag gcagtggagc acagtgaagg 21000 
gtctggaatc aggacccagg tgtctgtgga cactacacat aaaagagcag catttacaat 21060 
gaatggatag gatggaccat cccaccaagg tgttggacaa ctccctattc actggccaga 21 120 
cccctacctc ataccatata, caaaaaaaaa aaaaaaaaaa aaacccagac agaataatgt 21180 
ctgaatgtaa aacataaaac agtaacagtc ctggaagaaa ataatggagg atatatttat 21240 
aatctggaga tggagtaaca agggatagga aaaaagccat agggaaaaag tagagttatg 21300 
attatatgaa gcttcttaat atctttatga taatgtacca ccagaaacaa ggatgaagga 21360 



'^O0V92S91 PCT/USOl/16946 

86 



ctagctacag accagcagtg aaacctgaaa caaacagaac aaagaattaa agtccatacc 21420 
aaataaagac ctcccacaaa tctataagaa aaagataaac aggctggcac cgtggcttat 21480 
gtctgtaatc ccagcacttt gggaggcgga gatgggtagg tcacttgagg tcaggagttc 21540 
gagaccagcc tggccaacat ggtgaaaccc tgtctctacc aaaaatacaa aaattagcca 21600 
ggcgtggtgg cgcatgcctg tagtcccagc tacttgggag gctgagccag gagaacagct 21660 
ggaacccggg aggcagaggt tgcagtgaac caagatggca atcgcgccac tgcactccag 21720 
cctggaggac acagcgagac tctgtctcaa aaaaaaaaaa aaaagaagaa gaagaaaaaa 21780 
gaaaagaaaa agacaacaga aaaatgggcc aaggataagt gtaggcaatt tgcagaaaag 21840 
taaataccaa taaaccagaa atgagggttg tgcaaatcaa aaggtgttat aatttttaac 21900 
caaactggac caaagaaaac accaaaaacc aaaatcttgt aattgccagc atcagagagg 21960 
atataggaaa gtgtgtgttc tcgtagatgc ttgcaggtat gaactgctac agccttttag 22020 
gagttatgta tgtatgtatg cttgtatgta tgtatttgag acagggtctc gctctgttgc 22080 
ccaggctaga tctgttgcag tgctgtgatc atggcttact gcagccttga cctcctgagc 22140 
tcaatagatt ttcccacctc agcctttcaa gtagctgaga ctacaggagt gtgcaatcat 22200 
actcagctaa ttttttaaat tttttgtaga catggggggt ctcccaattt tgcccaggct 22260 
ggtctcgaac tcctggactc aagtgatcct cctgcctcaa cctcccaaag tgctgggatt 22320 
acctggatga gccactgtgc ccggcctcaa tatctttaaa aacagaaatg gacacactct 22380 
ttgactagga atgtatccta taaaaacact tatacacatg cagagacaca cgagcaagca 22440 
tgctttgtaa tagcaatgaa ggctggaaaa actcctcaat caggtaaatg ctgtcaagtg 22500 
cacctgtgta ctatgaaatg gcacttggct tttaacaaga gcaaagacag aaaagcaaaa 22560 
gtacaaagta gggtgtgatg gcacatgcct gcagtcccag ctactcagga ggctgaggca 22620 
ggaagatcct ttgagcccag gagttggagg ccaggagctg ggcaatagtg agaaaaaata 22680 
aaattaaata ataataataa taaaataggc tgggcacagc ggctcatgcc tgtaatccca 22740 
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acacmg^ aggctgagg. gggagga^g cugatccca ggagn-ag gccagcc«g 2«00 
gcagcaaagc aagacaccca totcaacg^ aa«te«aa aaateagcc ggcaggc«g 22860 
gcatggtggc,cacgcctg.a..cccagcac«gggaggecgaggoaggcaga.cac« 22920 

gagg^cagga g«cgagacc agcctggcca acgtggcaaa accctgtctc — 22980 
acaaaaa^ gCgggeatg gtggcagatg c«gtag»x cagctaCga ggcacaagaa 23040 
.cgcagaac ^gggtggca gaagtucag ^agccgaga tcgtgccacc gcactcc. 23100 
cg^gcgtsag^agactccgtctcaaaaaaaaaaaaaaaaaaaaaacaaggagccagg 23160 

ca^ggtgggg .gagggaggg caoagaagca gcgcc«c tgggggcacc cccaatcct 23220 
agcgatccag aggcccagg atcctgaagg gagaaaaaac gtgaagCcc gtgctagaag 23280 
agaccatagagaaggaa^agcgg^catmacaaaaaa^gaaacgaggccc. 23340 
agaaggtsagcgccctcaatgccccacagggaggcagggagagggc^tgagcccgca 23400 
g°ggccctgga«cttg=aa.gggg«gag.giagcc««ccgcccccaccaggcacc. 23460 

ctcaggagag gagccgagt — g aaggggtcct tgagcccctc aaaag^ 23520 
aaaocacm cc.cc«gag tgaacc»ca oc^g«a accacaagaa aaaoacan ' 23580 

aaggcccagc gcagtggOC atg.c,g.aa ^gcact «ggga8S<=t gaggtggPS ^«0 
gat.go«gagcccaggag..caagaccagcctgggcaaca«gtgaaaccctg.c«a 23700 

,3aaaaacaacaaaatcagctgggogtggtgg«cacaco«agg.cocaacUcttgcg 23760 

ggc^aggtg agaggamgc ttcagcocag gaggUgagg ctgcagtaag cggtgactga 23820 
^ctgcactccagcccagcaacagagcaagactcaaaaaaaaaaaaaaaagcaggcc 23880 

g^gtggtggc^gcagtaat^g-ccttgggaggccgagcgggaggatcagg 23940 

aga«gagacca.c«ggctaacacgg.gaaaccccg««a«aaaaa«caaaaaa. 24000 

.agccgggcg «g«gcggg tgcctg-ag. tccagCact caggaggCg aggcaggaga 24060. 
aaggcgtgaccogggaggtggagc^gcagtgagcgaga^^acaccgctgcacccag 24120 
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gcttctcatc ccagcacagg tagggggtgc tatgggaaag ggatcctcag ttggccctgt 24360 
cactgctcta tcagctgggg acgtggcatc ctagtgaaaa catcatggcc gggcgcggtg 24420 
gctcacgcct ggaatcccag cactttggga ggctgaggag ggtggatcac ttgaggtcag 24480 
aagttcgaga ccagcctggt caacatggtg aaacccatct ctactaaaaa tacaaaaatt 24540 
cgccaggtgt ggtggcgggt acctgtaatc cgagctactc gggaggctga ggcaggagaa 24600 
tcgcttgaac ctgggaggtg gagcttgcag tgagccgaga tcttgccact gcactccagc 24660 
ctgggcaaca gagtgagacg ctgtctcaaa atctcaaaca aacaaacaaa caaaaaacaa 24720 
acaaacaaag cgtcatttat ccagcacccc tggggaacca tgctacctgg tgttttatgg 24780 
tacctggcaa ggtgcaggtg aagttgctgc tcttgggcat tgaacccgtc ttgtttgggg 24840 
cagctcaggc cccaggcagg gtccgggttg gctctcgttg gtgtggccct ggcccatcca 24900 
gacctatatt tctgccgtcc tgcaggtgat caatgttgat gggacgaaga ggcggaccct 24960 
cctggaggac aagctcccgc acattttcgg gttcacgctg ctgggggact tcatctactg 25020 
gactgactgg cagcgccgca gcatcgagcg ggtgcacaag gtcaaggcca gccgggacgt 25080 
catcattgaccagctgcccgacctgatggggctcaaagctgtgaatgtggccaaggtcgt 25140 
cggtgagtcc ggggggtccc aagccatggc tcagccatgc agacttgcat gaggaggaag 25200 
tgacgggtcc atgcctgggc ataagtgttg agctcaggtg ccccgacctg gggaagggca 25260 
ggacaggaaa ggtgacagta tctggccaag gacagatggg aagggaccaa gggagctgat 25320 
tagggagtgg ttatggacta ggaatgtcgg taacaatggt tagaaagtga ctaacatttg 25380 
ttgagcacct gctgtgtgcc cggccctggc cgggagcctt cgtgcccaca gtgaccccgt 25440 

ctgcaaatgtagttccttgccctactcgcactggggagcaggacgcagagccgtgcaact 25500 
cacaggtgcc aagctcagga ctccctcctg ggtctgcctg ggctgggctg tgcttgttgc 25560 
ccctgtggcc cacgcatgtg caccttccac ctgaaagcca ggatcttcag gacgctcccc 25620 
gaggaggtcgttgtctggcacaatgatttgtctcttcctg^aaaaggtgacagagttacac 25680 



wo 01/92891 PCT/USOl/16946 

89 

1 

cctgggcgac agagcaagac tccatctcaa aaaaaaaaaa attaaatctc aaaaaaaatt 24180 
acattaaggc aaactaaaag atgtttaaaa tatatatatt aaaxtaastz cactccaata 24240 
gagcaaatac gaaaataccc agaaaacaca atccccgcac ccccaggaca acctcccagg 24300 
gggtccacag caagagaccc caagcacgag agacagagaa cagtgtccct gtggcggaac 24360 
ctctggccca tcaggctcta ttagaaaata aggctcttgc cactgagaga aagaggcaca 24420 
gtcgcccagc agccacgggc tctggcacac cacgagtcag gccagcaaag tgtcaactgc 24480 
cccctacaag gtgacaaact aggacaaact ggaaaccaga ggctggacct ggagcacagg 24540 
gaccaccaca tggggctggg gaatgggcag ggacctcaga gcgccaccca catgcctaag 24600 
agcagcgcgt atgcgcatgc ctctgcatgg cttagggaca cagggagctc cccccacccc 24660 
caacccagga aggcagcccc cactacccag gtagggaacg gataggacca gcaccccgtt 24720 
ctgctcgtaa ctcagggctc caggccccct cgggggcaac cagcacagag ctcagacccc 24780 
aaatatcttc acccacctcc tggtccccat ctggacaagg gtgctgggga ctggctctca 24840 
gtcacaccct cggggtactc ttcaaaggac agctggatgc cccagggcag gagctrttgg 24900 
cccccagctc cctcacccca gacaccagct cttgggaccc caccagcatg ggcaaggtgg 24960 
acaccatcgt cccgattttg cagatgagga aactgaggct gagggctggc acacggctct 25020 
ccagagctga agagaatgca gagagcagcc ggagccagcc ggtgggtccc tgaggccggc 25080 
tcgtagcaag ccacagctgc ctccgcccat cacacttgga cctcactggc cccaggacag 25140 
ccctccaggg cggcctggca cagagcccac accctgctgc ttcctgaaca aataagtgaa 25200 
caaggccacc aagccgagga cctggatgta gccccggctc ccgccagggc ctccccaaca 25260 
gactccccat ttggagagcg cattaagtgt ttccaaagcc tcacaaacca cagatgtccg 25320 
gctgtctcac ggcttctgta acctgaactt ggccctcact ctgccctccc agcactcctc 25380 
tcagggccca ggcccctcct ctgagatgcc agcactgact ccccaacttg tccccatcac 25440 
ctggctcgtt cctgaacctc ggcaggagag tctcaggcca gatcctccca ccagccacct 25500 
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ccaccaggat gcaggaggca tgagacctgc tcgtgccggc tgggagatgc aaccaaccaa ' 25560 
gatcaatcca atcagcggat gaactgacaa atataatgtg gtccctccac acaatggaat 25620 
attattcagc cacaaaaagg gctgaaatag gccgggcgtg atggctcaca cctgtaatcc 25680 
cagcactttg ggaggccgag gccggcagct cacttgaggt caggagttca agaccagcct 25740 
ggccaacatg gtgaaatccc gtctctacta aaaatacaaa aattagctgg gcgtggtggc 25800 
gggcacctgt aatgcaagct acttgggagc ctgaggcagg agaatcactt aaacccagga 25860 
ggcagaagtt gcagtgagcc aagatcgcac caccgcactc caacctgggc aacagagcaa 25920 
gactccattt caaaaaaaaa ataaaaggct gaaacaccca tacgtggtac tacttggatg 25980 
actcctgaaa acgttacagt aaccaaggaa gtcagccacg aagacgcatt gtaagattcc 26040 
cttcatgcaa aatgcccaga acaggcagaa ccacagaggc agaaagtcga ctggtgttca 26100 
ccaggggatc cggggagagg gaacgggaag tcaccgtgta atgggtatgg gttttatttt 26160 
ggggtgatgg aaatctctta taacttgata gaagagaggg ttgtaaacac tgtgaatgta 26220 
ccaaatgcct gccttctata ctttaatatt ttatattata taagtttcac ctcaatttaa 26280 
aaaaaaaaca actcgacacc tttcacctag gaaagatctg gctttagctt gcatttcctg 26340 
taactcctgc ctaaagcctt ccagaagctt ccgctgcctt gtggatcaca accagactcc 26400 
acaccatgat ctggcctcta agggcctctc gcaggacacc ccgagggtga aggagcaccc 26460 
gtgggcccac ctctgcatag ctgcaaagct tctttccctg tcctcccctc tacatgggaa 26520 
gctctgcccg caggggcggg gccttatctg ccattctatc gcactcaacc ctagcacttc 26580 
actcggtagc agacaccaaa gcaaaacagc aacagcatta taccgggcca ggtgcacgtt 26640 
aactcactga attcatggta ggaaggattc tattcccatt ttacaggtga gaaaactgag 26700 
gcacacaaag gtagcatcag cttcctaagc ctcccagcac aggaagcggc caggctggaa 26760 
tcagaccctg ggcgcagggg ctctgtccac agtgctaact aactactcct gcccccgagg 26820 
gctgcagcgg tgagtgagtg agtttgtcag tggactggat gjf caaggtc atacaggaaa 26880 
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aatccagact attgtaataa cagcctctag accggctggg gccagaaaga tcgaggacgc 26940 
tgacacacaa ctgcgctcac tgcagctctg ccagggatgg ggctaaaggt ctcacacagg 27000 
gcagttaggg ctccccatag cctgggagag gaacggggtg agataacaga aactaggtat 27060 
ggtgcccgaa gtcaaacagc cactgagcat gtaaacccag gtgggtctga ccccaaaccc 27120 
ctccaccccc atcagccctg caacccgtcg ctgcaaggga gaaagcaact cagaggcctc 27180 
acctgcctac atcccccacc cgtgtgtgtg agttctacta aatgcctgag cagtgacaca 27240 
gcacggctga aattaaacgg gttccaaaaa cgacaggaag cacgaagtga atctccccag 27300 
gaaagtgctg aacaaatgct ggatcgggtt caccggcgaa tttcttggaa ctgaagaggg 27360 
gagctaaaca cacggggccc tgctttggag gggactctct cagggtgctc cacacagcac 27420 
ttggttaacc ccactcagcc cttctgggct ctcccagagg gcxcggcctt ggccttgggc 27480 
atctacagga ggaacctcca gggggagagg gggtgcctgg acaggccggc cctggaacaa 27540 
gcacttgggc cccgaggaga gaggactagg gcttgggagc tggggaagtt ctcagcactg 27600 
ggaccactag aacaaagcca tttccgtgcg ttcacagctt ccaattgcaa caggaagcaa 27660 
tcaggaaaaa taattagcgg cccacttact ggcttcgctg aggtccgagg catgtatttc 27720 
acacagtaaa accagggata taacatcaaa accgttctgc agaaagattc ctccctttcc 27780 
ttccatttta ggcctggatc accacattca ctggggctcc caggccttgc tgcctaatgt 27840 
taaaataatc aactctattt ttgcctcaca cacaactgaa ctctacagct ataattcttt 27900 
ctcctcaggg gctcgaacca catggacgac aggcatttga ctccagcaac atcaccccaa 27960 
aacgtgcaca aaacccaaaa ctgcaatgag gtgaaaggca acgcggtcgg cctagaaacc 28020 
ccccctttaa aacaaacagt ttccccaaaa ccccttttgc ctccttgacc caggcatttc 28080 
cggaaaaagg agcggcgctg gcctgtactc cccagatact gtcgctgttt tgtcttcacc 28140 
ttgttttgct agctccagac aaggccccac aatgtaaaca cgctcctgaa agaggcagat 28200 
ttggggtgaa actgtccata gaatctctag gcttgggtca gaggcaggag gacgtgaaac 28260 
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aaactccaag ctcctcctgt tccccgctgt cccccacacc tccaagcaga ggctgcagcc 28320 
tgggggatct gactacaggg ccaccccgct gcaccattca cactggaaat attcagggag 28380 
acagctgttt gccttaagga ggcccagaca aaggggcccg aggtcctcxc cgctaaactg 28440 
ccacaaacag aacaggagcc gcggcgtgca caggcacttg cggccgtgcc acttggccag 28500 
ccatactcca gaaaaacaaa acacgcacat ccgaagagaa tgatttaggt agcaagaggc . 28560 
ttgcttgaaa aaccacatgg caatctccaa attaaaagaa catgtgtagc gtttcacgac 28620 
tgcttaagtt tcctgagtcc tcctgacctc aactccaccc cctgggaaac accaaaagtt 28680 
ggatgagaaa gttcccccgc cctacctctc cccacgggag tgtacaactg aggcacaagc 28740 
ctgcctcccc cactgccccg cgatctggga ccacgtctcc tccgcgtagc cgacccgggg 28800 
atggacacta tctggggacc cggcggccac acggggcatt cgggtcgccc gggcacctgg 28860 
caggtgtcag tccgcttgga aacccacagc cacgcggctc acaggagcag cgccaccggc 28920 
taggccgccc cgcgcccggg ctcagaactt tctcgctgcc acttcagccc gtcctcggag 28980 
cacgcggggc ggccgcgcgg ccgctggaaa caggcttgcg aaccggctcc ccgggccagg 29040 
cccgcctccg cgccccaagt ccccgctcgg tgcccggccc gggccacacg ggcccagcgc 29100 
gggctcggct cggctcccgg cttcccgcgg gctcgggcag gtgaggaccc gcccgcgccg 29160 
cacctggcgg agcgggcgcc ctcctcgcca gcccgggacg cagcgtcccc ggggagggcc 29220 
cgggtgggga gacaaagggc ccgcgcgtgg cggggacgcc ggggacggca gggggatccc 29280 
gggcgcgcgc cccaactcgc tcccaactcg ccaagtcgct tccgagacgg cggcggcgcx 29340 
cgcgcacttg gccgcggggc cgcccgggcc attgtccgag caacccgcgg cccgtcttac 29400 
acgccgggcg cgggaaggta tcgaatcagg 29430 



<210> 8 
<211> 33769 
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<212> DNA 
<213> Homo sapiens 

<220> 

<221> unsure 

< 222 > (33739),(33749),(33758) 

<223 > Identity of nucleotide sequences at the above locations are unknown. 



<400 > 8 



cttcccctta cactggtcct tcgacccgcc tcggatgaaa actgaatggg tttagcctta 60 
gaggctctcg gtctctaagg gaggtgggtc aggatgccgg ggacagggtc ctcttcctgg 120 
ggcaacgtgg gggaacgagc cacctacccc tccactgaat tgccctgggg tgtgggtacc 180 
gacggctcat tcggtgtcca gggtctgaga tgtgttgaca ggaagaatga aaggggatgg 240 
gagggatggg gcgaaagaag ccacctgcag ccccaggaac tatctggcca gcacaccgtc 300 
acccagcggc ctgagccacc cctgccagag ccaggaggag accctgccaa tgggtcacca 360 
gtgtgcagga actcagaagg tcatcacagt taataccctc catgccccaa tgtgggaaaa 420 
caggtttttt cacaacaaac aagataattt ttgttatttt ggcaaaagga ggcagggcag 480 
ccccggacac ctccatccca cctcatcacc cagccgcagg gccccggcca tccctgcaga 540 
cagagtggat gtcacaacct ccctgcaccg aaccaagtgc agctcccagg ccacaggcca 6^^ 
cccaggaaag gtccagtggc ccccggaggc tcccaccgca ggcctcccac cacagccggc 
accaacccag gatagctgtg ttctcctggc ttcttttcac acgggtagca gaaagctgag 720 
atccggggaa agctgagatc cagggaaagc tgagaatcgg cctctgctgc ccggacgccc 780 
acccccagct ctgctcccag ctccagggcc tccttctcag gtgcccttac aggaggcaga 840 



600 
660 
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gggcttgagc cacctcctgg gcctggggca cgcaggatga acggggtcac ggtgcaggcc 900 
actgtccact gcgcagatcc caaggccata aacagcctgg ccacagtggc ttcccagctg 960 
gcaggcggcc agattatttt tgttgtttag caattgatta agtttctccg ctgcccccag 1020 
gggtaagtgg tggggcaaat gccgcaaccg cagcatttga cccgggatcc tgtgccaagt 1080 
gaccataggg tcacaaagca caagggaagt ggctgggccc gatgctggct ctgctggaac 1 140 
ctgaggccgg ccactgtcac ctgcacggtg cctgggacct tccagcaagc acagagaagc 1200 
tatggccctc caggagcagc tggcaggcac cttggcctgc agtcaggggc totgtctgct 1260 
cagctetaaa acaggaaagt cgctgctctg cctggggtca gggcagccag agagtgacca 1320 
agtcagtgcc ggcctcagga agggacctgc aggcgggtcc cttcctctcc catccctcgg 1380 
tgccagccag cccctcctgt ggccccccac tgcctgcctc tgcccccatg ccccaccaca 1440 
acctcaggcc catggctgca tggccactcc ccaggcaggc agtggggatg ggattteacc 1500 
atgttggccaggctggtctcgaactcctgacctcaggtgaggagttcctaaagtgctggg 1560 
attacaggcg tgagccaccg cgccagccct ccctgtggta ctaaacactc acaccccctt 1620 
gctggggacc ctggtgaggg aacacagcct cacaagtgaa gtgtggtttt gttgagcaaa 1680 
tgacgcctgg gcagccctct catctttgcc taaaactgaa gaatttaggg gcgtggatgt 1740 
ataaaacagt tggtgactta aatgaaaaag aaggccacac tccccccttt aggcaggcgg 1800 
cctaattctt taaaagccag cacagggtgc ctttctgaac ccaggcacac agtaggtgtt 1860 
caatggacag cagcggttac ttgtactgct catgacaccc tgtctgtggc ctctgcagct 1920 
ggctccagcc tgacgcatgg ctgcgcccct ccgcaaggcc accccggtat acatggaaac 1980 
tctgtggaga aggccttggg ggccggccag gacgccaggc ccagatccca tctgcgccct 2040 
tcctccatag acctcagcga gctctcggca ccatgtgcct caggcxcatt taagaagtag 2100 
ggccggccag gcatggtggc tcatgcctgt aatcccagca ctttgggagg cccaaggtgg 2160 
gtggatcacg agatggtcag gagatcgaga ccatcctggc taacacggtg aaaccccatc 2220 
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tctactaaaa atacaaaaaa taagccgagt gtggtggcgg gtgcctatag tccaagctac 2280 
tcgggaggct gaggcaggat aatcgcttga gctcagcagg cagaggttgc agttagcgga 2340 
gatcgcgcca ttgcactcca gcctaggtga cagagagaga ctctgtctca attaaaaaaa 2400 
aaaaaaataa aaaaaagaag cagggccagc cacggacgac ccctcacaca gctcccagga 2460 
cgcgtgcctgggtatagggctcaggaccatgaccgctgcagtggcccccaagaaacgtta 2520 

cttttgtcac ccaccccgcc tcagtggcag tagccaaaat aacggattag aatggaacca 2580 
tgtgacaatg ccactgcccc aactgacaga agatggctat cagcagttca cgcggcccca 2640 
cctatcacaa gtgcagggca ctctacaact tatgcatcct tccccagaca ccgtcctttc 2700 
gaccctccca ggtcagcaag gcacacaggg cctacatttc acagccacac agcagagggc 2760 
tgaggctgga actcggatgc tctgatttcc gttcaatcac atccccagag gtggcacaga 2820 
gacggggggc ttctcttgac aaagtcaaga aagtcactgc cagctccact gaagaccaaa 2880 

gaacctcagc tctcaaaccc tcttgaaggt gttaccgaac tctcccagcc tgtttcctgg 2940 
gtcccgatgt tggtcccgtg ggacacagga agaggaagaa gctccctaga gcagagcctg 3000 
gtgcacctgc cacactctca gagggctgcg cacgggcgga ggagccgtgt gcaggagtgg 3060 
ggtctggatg gaggggcgct gtggccgggg gcagggggca ggggaagggt gctccaggtg 3 120 
gtgggcacag cacgagcagg ggcagggagg tccacactca gatgtgcaca gggagaaaca 3 180 
aatcgtgcat ttccattgga ataggcggta aaaggtagaa aaacagagtg ggggccagga 3240 
agggagtcgg agccttctag tgtctctctg caggtgagcg gcagcccgag gtgtcagctc 3300 
agcagacttg gggtccaggg gccgtgtctt ctatcactga ccccagggca cacggaactg 3360 
gggagggaga gcagaggcac agggcacggt cagtgaaacg aaacaaggag tcatcaccaa 3420 
atgcggaaag ggcaaggagt gcccgcagcc gcacaagggt tctgtctggg caacgtgggc 3480 
gtcccaccag gccccgcacc ctgcaagcgc aaagctcgcc actgaagata aagggaagct 3540 
gttggagctg cggagctggt ctggggtccg catggagctg ggcttatgct gcagtcacaa 3600 
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gggggacatg gaagaggctg caggggacaa aaccagtgac cacagtctaa ctctgagcct 3660 
gtggaaaggc gcccacagca ttcacccatc ccagagatgc cattccccct gtgcccccgc 3720 
tccacggtga cagcgttctc caggaatatg atgcgcccct ctcctcttgc atcagccctg 3780 
acagtgagta ttcaggccaa aaagcagaag agcacagctg cgtggttcca tttccatgta 3840 
gttctggaac aggcaacgct aatccaaggt gatagaagtc aggagagtgg tggagggggc 3900 
gggggttgag gatggcaaag gggcaccggg aactttccca gtggtagaaa tgttctctgt 3960 
ctggaccgtg tggtagttat gcagacatat gcagctgtca aagttaatcc aaatgtacac 4020 
gttaaaatgt gtgcgtttta ttgcctgcaa gttatacctc aattaaaaaa ataaagttag 4080 
cactcaggct tcttccacaa cttcctgaac cgtgtgagct gattttcttg ctattaaaaa 4140 
ttcacggtcc atggctgaga acagcagctg ccttctgttt gcaaagtcaa cgccaatcac 4200 
tgcccggccg cggcagactc ggccccacag gacctccttt cttttttccc tttgacctac 4260 
ttccctgata agtgacaaga cagccagact ctgggaacaa acgcccgtta ttcggccccg 4320 
agctgagcgg gccctgcttc ctgagctaat ccgcccggac agacggaggg acgtgagggg 4380 
ctttgccgtc ggctccagct gtcagtctgc ccgtcagact cgacagtggc cccctctgtt 4440 
cctcccgctg cccccactcc atccccgact tctttttgtt tcctgtccct gacagacgaa 4500 
catctgttaa aactctgtct gggtgagctg tggccagcgg cccacaaatc cccaagccgc 4560 
accccagcct catctgggcg ctgccgggag cactgcctgg ccaccctctg gacatagctc 4620 
tgagagccac cggccagggc acgtgtggcc cgagtggcat ggtgcacgcc gctaagccca 4680 
ctgcccaaag gcccccaagc aggagggatg tgcaggagac aaaagtcaaa agaacagggg 4740 
cacgttccac agaggatggg gctggagggg tggcagtgag gaacagcagc ttccgaggat 4800 
ggcggtggca actcccaaat aaggcctcac tcctgctgtt tttagctcat tccacataat 4860 
tggaaaaaca tggcagaaac cgaagccagc tgctgccttg gtcctggggc tgtgtggagg 4920 
gggtggggag gccggaggcc caggctctgc actcgac^c tggggatgag agtgactctg 4980 
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ag«.«.ag agcagcatog .gccgcca. agocccggcc aogctgggeg 5040 

gcagaggcc gtgggauu cctgccetgt o^atggggg Kacttcagg aggggcgggg 5100 
.agccaggac acagcccagg gCagcggto acccgcagc .caggggcca cgtaaatagt 5160 
gccacc«gaaggcacacageagtgcggggcccccccogccacoaacgca.cc.acc. 5220 

-.agg^gccg«.gtgtgce^aa.gc.gc.cc.«.™««8^^^ _.. 

accaccc«cagcccc»c«g«gaaggC3CC.gac.ccc.acaoccagctggo«. 5340 

ea^gccaaaatcaggaaa^cagaattcaagacatcacagaaawc^cgcagt 5400 
aacuca^ga aagataaacg g^acaoc caggagggag .cocagggac ccttgag.. 5460 
cacc^aggc «ggcttca aa«tcgaga tgtttccag. catgcagcg ccgcccccca 5520 
caac«gccccacacag.cc.c^gaa^Sat«S.occccacotgccccgt 55S0 

« ggagegggtg cgaggg«S ^^^^^'^ 
.gag.tgag.ga«ctgacacagccaggcc«gcccccotcc.8>e«»gccccaca 5700 

^agccacacgcctgaagcgcccagcacaccccc«ccg.c«ccccagg«ac 5760 

ccg«ggccgtg«agccgtg~.gcccct.cacccaccccagc.oc.ctggc 5820 

3g=acccagcc«ggaagctac«tga<taoaaccgccgaaggaagactcgc,c«og 5880 
gcac^acc agacagcctg cacca^og cgctcagca caa^ccacac agccuc* 5940 

c.aaccccatggagcggggag«taa,caccccc««aocaacggacaaactgaagca 6000 
eag^aggaaag^cmccaag^cccaacacgatgacaaaaaaugaaggtcagc 6060 
ccgcaagtggaa.,agg.go..caag..ccgg«go«gaeactgcac«cc.cgcog 6120 
ecacggtccoggg.ccgcc.gacac.gcacc.c«cgccgcoa.gg.cccggg.ccgcc. 6180 

^cacgc^ ccccgccg ccacggtccc ggg^gcc gacactgcac ctcccgccg ■ 6240 
ccacggtcccggg^ogccgacac^cacc^ccgccgccacggtcccgggtccgcct 6300 

^cac.gcacc.cctcgccgc^gg.cccggg^g=4i-«Bcaco.c«cgccg 6360 
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ccacggtccc gggtctgcct gacactgcac ctcctcaaca ccaccacggt cccgggtctg 6420 
cctgacactg cacctcctca ccaccaccac agtcccgggt ctgcctgaca ctgcatttcc 6480 
tcatcaccac agtcccgggt ctgcctgaca ctgcatttcc tcatcaccac ggtcccgggt 6540 
ctgcctgaca ctgcacctcc tcaccgccac ggtcccgggt ctgcctgaca ctgcactttc 6600 
tcaacaccac tccttggccg gctcccaact acaaaccaag ccatgtcttc catcctgaat 6660 
cctcttggcc taaacatcac tcacaatgcc tccctcggga acaggcacaa gtcccaccag 6720 
cacagcctcc ttcgttacct gcgtttccgc tagcccaggg ccagctccag agccctcacc 6780 
acagagcctc tatccttcac ccccggacac tggacctcac caacccatag cctggaggag 6840 
atccctgtgt gaccccaggg cctcctctgc ccgactctga atttcactgc ccaacgtgac 6900 
acctcggaag gctctctggg cactggcagc cctccatggg caccgctcct tctggccagc 6960 
tctgacatcc cggctggtga ggtgccctgc acgaggcctc tgcccactgg gacctcacag 7020 
ccgtgctgtc agctgcaaca agcgacagaa tttcacgttt tcttcacgtt gcccctgggt 7080 
gagcagctcc aggtagtttt cagtcgaggc gaggcgtccc gtcagcagcc aggcggcaca 7140 
gctaattcat gcccgccggg cgcacggccg caataccaat gggcacctgc agcctggaaa 7200 
gccacagagg aaccgagaac agcgactgtg ctcaggtgac aggactgtgg tcttttaaca 7260 

aaacattttcctttaacgtgatattttacggcaaggaatgaaacctggagggcaggacat 7320 
ttggatacta aagccccagg ctgccgcgtg gtctgctttg tgaagtctga agcccgcgcc 7380 

ccattctggccccgctcacaggtccggctctgactcaccagcttcaatgctaggccgtgc 7440 
ctgtcctcca accagaacat gacttcctta aggacaaagc cgtttctcgc ccatccccat 7500 

ctccctctggattaagaaatatgggaagatcttctagaaccacctcaaatttgcagagag 7560 
ccatcctggt gacaaaccct tgaaatgctt ctaagaagag tttaggtttc ttctcaactc 7620 
taaaacctct agaaaactct atttccacac cagctgcccc tggaacactt cagcttcaaa 7680 
agggcccagg gcagggagac ggaggagcca gcatccacac cgagcaccag cctgttaatt 7740 
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aacgggaagc gggtggggcc catctccagg cagctctgag gtcagactgg ggaaccatgc 7800 
ttacaaaaaa aagtgaactg aaacgctcac gtcctcatgc aaaaccagac tcccagttgc 7860 
atctttctgt ctcattgagg agctttttcc tccctttgac agaacaccct acacacggca 7920 
tctggaacca aagcagaaag attcaggctc agagtaaaac agtccccaca ctggctgcat 7980 
gtggacgttc ccggcccaga gtctcgccca agcagggcct ataaatgaca caaaatgttt 8040 
ttctoctgcg tgccagtcat gctccaactg agttatgtgt aaaagtgcct ctcacggctg 8 100 
agggcaaaaa cagttcccac aagactagag aaaggtgacc cctgacggct gagtctctag 8160 
ggagcgtgga gctgcgtgct cagcxctgcg gccctgacgg ctctggaatg gaaaagctat 8220 
ccaactggaa gggcagggct cgctgctagt ccagcggtcc aaccccacag gtgtctgtgg 8280 
tgtcagctcc atgccacaga gcccagggct ggggccagag ccaccaggcc ccctgccagc 8340 
ctgcaggggc ctcctcctct gggtagccta accaccccct gtgagcgcag gcagcctcct 8400 
ctaatcacca cagggcctgt ccccccctct cccccgcttg caggaaaatg agccctgagg 8460 
actccccagg gctgctctgg gcctggacat ggagactggg aattacattt gcagaaggag 8520 
cgcaatgccc ttgaagggct cagccacgag cagccagtcc ccagggctca gaaggcccag 8580 
ctgttagaac cctgggagcc agcaaagagc caggggctcc acxtaagtct atagcccctg 8640 
cctcttctgg ttgggaaaga aatcaacgcc cctttactgg ctcccactga cagcccactc 8700 
ccccaggtat gggaggattc tgggacgatg caggcaaacc tggaccctga gtgaacctgc 8760 
cccagctctc acgggcctgg caccagccac agcacctaag gcgccggtca tggtgacaac 8820 
atgaaggtga taagggcatg gacagtggac atggcagctg gacactgggc acccactgga 8880 
tgccaggcac ccagcacggc tccgtcaccc ctggatgagc agtggccctt tgcaagccag 8940 
ggtagcctgg gcaagttatt tgggggtctc caagcttgtc cagctgtgcg acttcactga 9000 

gccatgagto tgggatttta tcagggccca cacccgttcc tggaactctg atacgtgagg 9060 
gagccacaca gggaccctta acaaaagctc ccagggcaac atgttctctt gcctcagtct 9120 
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cccaaatagc tgggattaca ggcgcacgac taccgcccgg ctaatttttg tatttttagt 9180 
agagacaggg tttcaccatg ttggccaggc tggtcttgaa cccctgacct caaatgatcc 9240 
ttccactgtt agggcaaggc acctgacagg cacgactgca cgatctgctt gttgggggct 9300 
gtgtccattc cccactcctt cgacaaatgt ccacacccag ccttgctttg acaccccaag 9360 
aacagagatg gtgacacctg cttcctacat gcccattgct ctcccaaggc agacatcccc 9420 
agcagatgca acacagtgtt taggcagaca tcaccaatcg atggtggcaa cagacaccag 9480 
gccctgctcc ctctaactcc agtggccagg ccccaagcca gctctcacct gcccactccc 9540 
aacccacagc agcaagactc agaaatggca aaaacacaaa gagaacagaa acgccccata 9600 
gcgggaggat gactaaaaga catgtcttga taagatattg ttcaggcata ggccaggcac 9660 
agtggctcat gcctgtgatc ctagaacttt aggaggctga ggtaggtgga tcacctgagg 9720 
ttaggagttc aagaccagcc tagccaacat ggtgaaaccc catctctact aaacatacaa 9780 
aaattagcca gacatagtag cgggcgcctg taatcccagc tgcttgggag gctgaggcag 9840 
gagaattgct tgaacctggg aggtggaagc tgctgtgagc cactgtactc caacctggac 9900 
aacagagcaa gactctgtct caaaaaaaaa aaaaaaaaaa gatatccttc actaaaactc 9960 
atgtctttgatacatatttacctcctgcaatcgcaaatgcttctgcagtgcataaagtga 10020 
aataaatagc aggaagcctt acggttcgat cacccacaca gacacacagt cacatacagg 10080 
aaaaacgcag ggagggctgg ggaacaaaaa aacagaagat aaaatgtgga gacagacaca 10140 
ccaagagagt aagagaccac ctccagacct cccttcagct tctcaaacac acgagccggg 10200 
cccgttacag aatttgcggg gaccgctgca aaatggaagt gcagacagcc ccttactcaa 10260 
aaggtaggaa tttcaggtca acaacagagc tcacctcata tgactacaca ggtcacacag 10320 
cccgtgaagt cggtcccaac accagcatgc tcctgcctca aagccgctgc acgtgctgtt 10380 
ccttctcgcctttccctcttttagtccttcagatctcaggcctcctgagagaga^^^^ 10440 
acctgccggc tcaggcggcc acacccccag tacaggagtc tccggctcag cccctgctgt 10500 
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gttccgtacc cgatccaggt ctgtcctatg tccatctgtg tgccggcttg cttcctgaca 10560 
tggcccccac cacacgtgtg cctcggggca ggggaacagg cccgtctcat taactgcttt 10620 
cttctcagat attttctgga atatttgtgg atattgggca acatatatgc tccacctttt 10680 
tcagactagc caggacgagc tgcatttttt tttttttttt tttgagacag ggtctcactc 10740 
tgttgcccag gctggagtat agcggcatga tcttggctca gtgcaacctc cgcctcctag 10800 
gctcaagcaa ttctcctgcc tcagtctccc aagtagctgg gattacaggc ccgtgccact 10860 
actgcccagctaatttttatatttttagtagagatggagtttcaccatgttggccaggct 10920 
ggtcttgaac tcetgaccto aaatgatcca cctgccttgg actcccaaat tgttgggatt 10980 
acaggcgtga gccactgcgc ccggcccgag ctgcctgttt tacacctttg ccatattccg 11040 
gtgattctct ctcccctccg tcccccggcc ctgactgtgg tggccactcc ctgccgtcat 11 100 
gagcccgtat gtcctcactc tttccctttc cgccaggact tcaaccaaca ctgcagagcg 1 1 160 
cagggtccag ctccagcact gagttcagcc tcttctcacc aacagacagg caggaaagaa 11220 
aacaaactct gagaaggcca aggttcccgg gcagccagca agccaagcal ccttctccgc 11280 
tgaggcttgt gcagccgagg caccccctcc tccagggagc aggcagcgtc ctggggcagt 1 1340 
ctgcgaggga gaccagggcc cttgctccac cagggcccca ggtatggggg cagcagcaaa 11400 
ctcatggctc tgggagccag accccacctg ctagaaccta ctatgccacc tgctgtgggc 1 1460 
aaccccaggc tggtgacttg ccctggcctc ctctgtaaac aaagggctca tccaacctgg 11520 
tcaaaccact cctccccttc aagggtctat aatcctccct taacctgctt ggtccaaacc 11580 
cctggtgtcg ccaggtcact caggaggcag ctcatctgga ctccttccct gggtccagtt 11640 
tctctctcaa cattgccttt gaggccgagg tgaacggtca acagcgaagg gccccagagg 1 1700 
tgatggagga gcgggtgtcc aagacactca ccctttctaa tgcactgact ccctcgtgga 11760 
ctcacttgtg ccgtctcccc cacccaccca gccccagagc ccagagtgcg agcgccagag 1 1820 
gcccgggatt ctgtctgcac cgcggggtec ccagtgcctc ggagcaatgc cagcacccgg 11880 
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caagtgttcg acaaatgcct gctgaatgag caaatggatg gatgaacgaa tgaatgagca 1 1940 
agcagatgaa tgaatggggt gctgtccaga gccgtgagga ctaggccgcc caagtcccca 12000 
tttctcaaat tctccttctc ccgacttggg aaacaagatg cttggtcggg gaggctctec 12060 
aaccatcccc tgcagcagcc ggcacagcgg acagaccctt tgatgtaaca gccatgtctt 12120 
cattaaagat gccctgctct cagaaagaga aagacaaata caaacctgga aaatcctcac 12180 
caaacgcagg acccctgcca gggagcagag aaaagaccca cacgccacgg gcgccacgac 12240 
cacacacaca ccccagccgc tgcacacaaa cacagaccct agccagcaag aacaggggga 12300 
ccaggaaact gttcctaaag tcaggacccc catgtgctca gacagcagtg agagcaagga 12360 
cacttctcca tccaccggat gccaggagag tccttttagg gggccccaca ccgagactct 12420 
gcccttagga ctgttcctga gtgtggaagc cagcccactt ggaagecccc tgccctcccg 12480 
agtgggacac cggcacagga agcaggccct gtcccccacc actttctgca agctgggccc 12540 
catcacgcta cagaaacggg gaggactggt cccagggatg gcgctttcct gacacctctc 12600 
gttaccccct cgcttgccag gccccagggt cagccccaga ggccagactg gctatcccag 12660 
gcccgggagc atccccgaag gcgagctgca tcctgaacgt gtgtgatttc ccgaagggcc 12720 
cgccccgaac cgacacctgg aaagaaagat cctcagccgg tgccccagag gagaagagcc 12780 
atgcctcact gcaacacagt cccaggaagc accaagtgcc tgaggaccaa ggcggagagt 12840 
aaaaaagtgg aaaatatctg gggcaaaaat aaaacaaaac aaaacaggat tgacctcctg 12900 
ggctcaagca atcctcccaa ctcagcttcc cgagtagctg ggaccacaga cttgaatcac 12960 
cacacccgcc aagtggatca tttcgaacgg gtttgccgag gttccttctg gggcaccccc 13020 
ggcggccgca acccattccc gccaggcccc gccccgcccg cccgccccgt cccgtcccac 13080 
cgcctcacct gccttacacg tcctgccgtt gtcctgcagc tgcacacccg tggggcaggc 13140 

gcatgtgtagaaaggctcgcttggggacagcaggcacaggtgggagcagccgccattgtc 13200 
ctcctcacag cgagtgtgga ctgagaaaac caggacagac tgagagaagg ttccagaaga 13260 



wo 01/92891 PCT/USOl/16946 

103 

ggaccgtcac ttgtttctga atgagtcaca tcctgcctcg tcccccgtga cagcctccag 13320 
tgtgtccctc tgcccaaaca tcggcctcaa gtggcatcag ggacctcccc gcgggcacca 13380 
ttccacctgc ctcatcgctg gccccgtcca catggggccc tcagcctggc cagacggcct 13440 
gcaatttccc caaaaccagc cgtgaccttc ctggccaccc tcacacccag atgtgacctg 13500 
cccatggagt gacatcctcc ccatctgctt cctcccacca agctcctatg actagaacac 13560 
cctccccagc tcctcggagc ccccaaagga cacccctctg caaaggctgc cccccacgct 13620 
ccaatggccg gggtcaggac ctgcctgtgt ggtagtgacg ggaaccccag agacaatggg 13680 
ctcctgggca aaaggcttgt cttgtctttg tgctatgtgt ggacccagca gcttccatag 13740 
gaacactgtccttcttgctgggatggccaagcttgtcactctcccaagccctcctatgac 13800 
caacagcaat tgaacggaac tcgataaatg cttccagcac ctcattcaaa ccaggggaaa 13S60 
gctgggtgta gcagccccaa aatacggata taactggaac aacaaactca tcaaaatgaa 13920 
(xtctccctccctcatgctgccccaagtgtagatgggttttgtgaccacgactttctcac 13980 
caggaaacag ctccagagag ccccaccctc ctgtgtcctg ctctgggaac agctggcacc 14040 
cctaggcccc acatttcaat tcaaagtcca aaccttccat aatggcctgg ccagaaatct 14100 
ccatccctgg tccctgtggg agtgggccac tgtccxcaga gccgcagccc cactgtcaca 14160 
gaagctggtg catttcccca tcagggacct ctgtcacaac ccagcgtggc ccccaggctg 14220 
agaactgctg attctgggca gattattcat tgataaatac gcgacttgca gggccaagca 14280 
tggtggctca tacctgtgac cccagcactt tgggaagtca aggtgtgagg atcactggag 14340 
cccacgagtt tgagacaagc ctgggcaacg tggcaaaatc tctcatctct attaaaaata 14400 
catacacaca cacacacaca cacacacaca cacatatata tgtatatata aataaccata 14460 
tatatatata cacacatacg tgtatgtgta tataaataca tatacacaca cacacagaca 14520 
acttcttctg ggccttgaaa acgaggcaac cttccttgga aatccccttg ccactgctga 14580 
gcctgaaata gcccccatga gctctgcaga ggggtcctct gcaggcccgt gtcccccagc 14640 
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cagccacaca cctccctcca ttgcagcagg taccccttta gagagggggc cccccagagc 14700 
atgggcttct gcagggaggg gtcacctgcc cccccacccc acccacgccc gcgcaccccc 14760 
acgcccccgc atcctcccac tcccctgccc cgcgcccccg ctccccccag ccccctcacc 14820 
ctctcccccg tgccccaacc ggcactcaca aaaaggctgc cgctcctggc tcagcacctg 14880 
gatgtccatg ggtgagtata gggcactcag gatctccttc ctcttccccc cagtgcgctt 14940 
gttgcaggca tggatggagc gggtctgcca gtctgtccag tacagagtgt ccccggagag 15000 
cgtcagggcg aaggggtgcg tcaggctgcc ctccaccacc ttctgcctgc agtcagggaa 15060 
gcggggtgga ggagccatca ggagggtccc ccgacagtca ttgctgctga cccaattaat 15 120 
ttcttttttt ttttttgaga tggagtctcg gtctgtcgcc caggctggag tgcagtgatg 15180 
taatctcagc tcactgcaac ctccgcctcc cgggttcaag caattatcct gcctcagcct 15240 
cccgagtagc tgggatcact gatgcccacc actacgccca gatgattttt gtatttttag 15300 
tagagacagg gtttcatcat gttggcaagg ctggtctcga actcctgacc tcaggtgatc 15360 
cacccacctc agcctctcaa agcgctggga ttacaggcgt gcgccaccat gccaggcttc 15420 
ccatttgctt tcaaccagac aagtgaggcc aggtcaagag ccccaggagc tggcgccctc 15480 
gtacatttct cccggcgtgc acagggcacc tcccaaacac agcctgtgat ggtgacacac 15540 
gggctccccc aggtcaagtg gcaaagtctc ccccagggaa gaaaggagga agccatgcct 15600 
ggcaaaaagc acacctctcc tgcccaacgc tttaacctct gtatacaaat caggccatgt 15660 
gcactcgctc cttcttacaa tgctcataat ttatactttc agagtaaatg aaacttggca 15720 
tcaacccgag aaacagctat tcttttctag atgcttacag tgcccagcaa atgaggactc 15780 
gggtgtaatg agattatgga cactggaaac aggatcataa tgtgacgtgg tcggtaatgt 15840 
gcagttttat ttgcttaatg accctcgccc cgtgacaggc tccctgaggg tgggcctggg 15900 
ggcagaggtc cccgccacgt ccccajgccct cagcacagtt gccaggagag ggtgacactc 15960 
atgaagtggc acagggaaga tgggagctgt gggctotgca gatecaccac ctcttctgtt 16020 
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catttttgtt gatgctgttt tttaagaaaa ttattgaagt aaaattcaca ggacatacgt 16080 
ttactttttt tttttttttt ggagatgggg tctcactctg tcacccaggt tggagtgcag 16140 
tggtgtgatc tcagctcact gcaacctctg cctcccaggt tcaagcgatt ctcccacctc 16200 
cgcctccaga gtagctggga ccacaggcgt gcaccaccac acccagctaa tttttggggg 16260 
gtatcttttt ggtagagaca gggtttcgcc atgttgccca aggctggtct tgaagccctg 16320 
agctcaggcg atccacccgc cttggcctct caaagtgctg ggattacagg cataagccac 16380 
tgcacccagc ctaaatttac cactttaaag tgaatagtgt tacctagtgc attcgcaagg 16440 
cggtgcagcc tccacttctg. tctagttcca aagcacttcc attgccccac aggcaaaccc 16500 
cacacccggc agcagtcatg ccccagtccc cgcccccagc cccggcaaac acttttgatg 16560 
gacttaacta cacacattct caacatctca tataaacgga atcacaatat acagcctctg 16620 
atgtctgtct tctttgactt ggcaccatgt tttcgaggtt catccaggct gtagcatgtc 16680 
agtgcttcat cccgttttag gggtgaacca tattccagtg tgcagacaga aaccaatctg 16740 
tgcatccatt cacccactgg gggacctttg tgtcatttcc accctcggct gttgtgcaca 16800 
gtgctgctac ggacattact gtccattcac attttgtgtg aagacctgtt ttcgattctt 16860 
aagagtatac agctaggagc ggaattgctg ggtcatacgt aaatcaatgt ttacgtctca 16920 
aggaatcaac aaactgtttt ccacaatgtt gtcttttttg tttgttttct gagacagggt 16980 
cttgctctgt cacccaggct ggagtgcggt ggtgtgatca tggctoactg cagcctcaat 17040 
ctcctaagct caatccatcc tcctgcctca gcctcctgag tagctgggaa cacaggtatg 17100 
taccaccatg gccagcteiat tttctaattt tatttttttl tgtttttgtt tttttgagac 17160 
agagtctcgc tctgtcgccc aggctggagt gcagtggtgc catctcagct cactgcaagc 17220 
tctgcctccc gggttcacac cattctcctg cctcagcctc ccgagtggct gggactatag 17280 
tcaccggcca ccacgcctgg ctaatttttt tgtattttta gtagagatgg ggtttcaccg 17340 
tgttacccag gatggtctcg atctcctaac ttcatgatcc acctgccttg gcctcccaaa 17400 
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gttctgggat tacaggcgtg agccaccacg cccgacctta cttttaattt tttaatttta 17460 
ttattttatt ttattttttt tttttttgag acagagtctc gctctgtagc ccaggctgga 17520 
gtgcagtggc gggatctcag ctcactgcaa gctccacctc ccaggttcac gccattctcc 17580 
tgcctcagcc tcccgagtag ctgggactac aggtgcccac cacgatgccc ggctaatttt 17640 
ttgtattttt agtagagaca gggtttcact gtgttagcca ggatgatctc aatctcctga 17700 
cctcgtgatc cgcccgtctc agcctcccaa agtgctggga ttacaggcgt gagccaccgc 17760 
gcccagcctt miLULLl tttttttttt ttttgagata gagtcttgct ctgtcgccca 17820 
ggctggagtg cagtggcggg atctcagctc actgcaagct ccgcctccca ggttcacgcc 17880 
attctcctgc ctcagcctcc cgagtagctg ggactacagg cacccaccac cacacctggc 17940 
taatgttttg tatttttagt agagacgagg tttcaccgtg ttagccagga tggtctcgat 18000 
ctcctgacct cgtaatccgc ccgcctcggc ctcccaaagt gctgggatta cacgcgtaag 18060 
ccatggcgcc cagcccatgt ggccattttt cagtgagaga agccagaggc ccatcactct 18120 
cggttgctcc ctgggccatg ctctgcctca gccagaagca ctgagggaag gtcagcctcg 18180 
gcccttgccc cagccacagt cacagataaa ggggcctgca caggtctgtg tggctccaga 18240 
gctcgtcacc caacacacga cgcttccatg tgaatagccc caggtgcatc atgaagagcg 18300 
atggccgctg cagaggcaga agaatcccgc ggggaagcag gtgggagaga ggctgagaac 18360 
agaccagacc ctggagctac agaccctatg ttccaaccct ggctgggact agctgtgtgg 18420 
ctctgggcaa attcacatgc ttctctgtgc acaggggatc aaaatagcaa acacaggcta 18480 
ggcacagtgg ttcacaccta taatcccagt gctttgagag gccgaggtgg acacatggct 18540 
taagctcagg agtttgagac cagcctgggc aacatggtga aacctcgtct ctacaaaaaa 18600 
aataccaaat aaattagcca ggcgtggtgg tacgtgcctg tggtctcagc tacttggaag 18660 
gctgaggcgg gaggaacact tgagcccaag aagtcaaggc tgtggccgcg tgtggtggct 18720 
cacgcctgta atcccagcac tttgagaggc tcaggtgggt ggatcacttg tgatcaggag 18780 
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t«ag^ccagcc«gccaacatgstgaaaccccg.^<a«aaaaaaa.ac.^ 18840 
.gccaggcgt gg.ggogggc aocgtaatc ccag«ac„ gggagg-ga g.caggagaa 18900 
ugttagaac ttgggaggtg g^ggttgtag W-ga .ggtgccgct gcactccagc 18960 
cagggggacagagcaagacccatcccaaaaaaaaaaaaaacaaacaaacaaacaaaaaa 19020 

agagg.caaggc«cag.gaacoatga.tg.gccaatg.ac.ccagcctggg.gacaaag 190S0 

..agaccctg cccaaaaca ataaaaatat aaa«aaaa. aaaacataat agca^cgn 19140 
«a.gagg.gg.atgagcat.aaatgaactga^cgtccc.ggaaaacaguag<gc 19200 

..ggaagga «cgc.gccg coaccgccac cacca^go a.g«caac ctccauacc 19260 
^^ccwcaccatccmgacaggg-ccccagcg^gocmcatcc 19320 

.^^ccttcamcg^agatcaccagccccaagaaccacagtcacaggg 19380 

„^.ccaaa.ctcaaaccagacccgctgg.c.gcae«ccagggacaacagga 19440 

,a«ttcaaaccagcccaaaagagatg.gtggacagcataagaggaacaggagaaactg 19500 

aggcccttg ccc^agaat gagcUggaa gtgga.g.cc cggccKaC caaacc«ca 19560 
ga.gac«aggcccaga^gag««ag.gUcco.caggtcataccc.gagccagaag 19620 

^„aatccac.cc.catcaagac«cc,ccc^maaaacag«gc.gtt 19680 
.caggetgttaagttgtgggc^ttttgttacacagcaatggamoaacacacgaggc 19740- 
aggcaagtg«gagcaaagCgc.caagcoacaag.ctgaca.gtggg«ttgEcC 19800 
gtgmgcag aaatccagcc ao.gag.cc. ccca«cagt cacaogcc CCgcacag 19860 
acaccgcca ca.ccctgcc tgggccagga gc.ccac.ag .gcaggaa« gggtcgccg ^9920 
,^gg3ttcc.gacaccmgcacagggc.agcagcaggcagcac..gg..agtga 19980 

auaaagccccacctttacacagaagggatgrncataagggguattaagu^ 20040 
agcgggaag Catgcgac cagaaggcc maagcaa. mccaacga ggggaaaacc 20100 
etKc«c.ca«.cggccca.«attgagcac«acca«tggaaggccccc«g.g 20160 
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agactgggga atgcaccaat aactgagaca gcttccggct gttgccctca ggatgcctga 20220 
gctgggatag ggccagggtg ggggtggtgc gtgtgacagg gttactgttc acaaccctgc 20280 
cgggccataa gccctcccca acaattccaa aatccaaaac gctctgaaga tggaaagctt 20340 
"gttg^catctggtgacaaaacctcatttggtgcatgggccgggtgcggtggctcacg 20400 
cctgtaatcc cagcactctg ggagccgagg ggaaggatcc cttgagctta ggagtttgag 20460 
accagcctga gcaacatgtg agaccccgtc tctaccaaaa atacaaaaat tagccaggtg 20520 
tggtggcgca ctcctgtagt cccagctact cgggaggctg aggcgggagg atcgcttgag 20580 
cctgggaggt gggggctgca gtgagctgag attatgacat tgcactccag cctgggtgaa 20640 
agagtgagactctgtctcaaaaaaacaaagttaaaaaaaaaaaaactgtgcatgggtgtg 20700 
ggctacagatagtcttttctgccctacttagaatgaacgtgccacatttgctatagaaat 20760 
attcaagggc tggtggcaaa tgccacacag accctgacgc tgttccaagt tctgagaagt 20820 
cctgcattcc tcagggcccc agagtttcag agaagagtct gtaggcctga gttaagaagg 20880 

aacgccttcaaaagccctggggacaaaggggaaaggggtgccccaggactgcgtgggtac 20940 
ctaccggaac gagccgtcca ggttggcacg gtggatgaag ctgagcttgg cgtcagccca 21000 
gtagagcttc tgctcctcca ggtcgatggt cagtccattg ggccagtaaa tgtccgagtc 21060 
cacaatgatc ttccgggtgc tgccatccat ccctgcccgc tcaatccggg gcgtctcacc 21120 
ccagtctgtc cagtacatgt acctgtgacg ggggcagggc aagagaagca gctaacacag 21180 
atctgttttttgtttttgtctgcatagatgcagacatgaaacaac^gaca^^^^ 21240 
cctaaaatctcacccatcggaaataaccaacaggtatggtttcaggtattcxtgccttaa 21300 
gctgggcaatcaaaatatactatttccaacttgttctcagttaacagtaa^^^ 21360 
ccttcccttcttgtggatagaaagattccttgttcttttgatgattgcctagtgtac^^ 21420 
gctgtaagttttttaaagaacttcaggttatttctgatttttttgrtaccatgaaaat^^ 21480 
tgtaaatgaacctctaaaaggcaattcaaaacac^ggatggaatattatttagtggta 21540 
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taaagaaatg agctatcggc tgggcccagt ggctcacacc tctaatccca gcactttggg ' 21600 
aggccaaggc gggtggatca cgaggtcggg agatcaagac catcctggct aacacagtga 21660 
aaccccgtcc ctactaaaaa tacaaaacat tagccaggcg tggtagtgag cacctgtagt 21720 
cccagctact taggaggctg aggcaggaga atcatttgaa cccgggaggg ggaggttgca 21780 
gtgagcagaa atcgcaccat tgcactccat cctgggcgac agagcgagac tccatctcaa 21840 
i^aaaaaaaafl aagaaaagaa aagaaatgat ctatcaagcc atgaaaagac atggaggaaa 21900 
cttaaatgca tgttagtagg tgaaagagcc aatctgtatg agtccagttc taaacactct 21960 
ggaaaaagca aatacacaga gacagtaaag catcagtggt tgccaggagt tggagaggag 22020 
agggatgaat gagtggagca cagaaaatca gggcagtgga actatcctgt atgacatgga 22080 
atggtgggtg catgtcctta ctcatctgtc taaaccaaga atgtacaaat caagggcgaa 22140 
ccctcgtgta aacgtggatt ttgggtgatg gtgcgtcagc cagctttcat cagttgtaac 22200 
aaatgtacca ccctgcacag gatgctgaca gttgggaagg ctgtgtgggt gtgaggacag 22260 
ggatgtatag gaactcagta cctgctgctc atcaattttg ctgtgaacct acaactgttt 22320 
gaaaaaatta agtctattta aaaacaacaa aacatggcca ggcacgatgg cttgcacctg 22380 
taattccagt acttcgggag gctgaggtgg gtgggtcact tgagccaccc tgggcaacat 22440 
ggcaaaatcc cacctctaca aaaaataaaa attaaaaaaa agttagctgg gcatggtggc 22500 
acactcttgt agtcccagct acttgggagg ctgacgtggg aggatccctt cagccctggg 22560 
aggtcgaggc tgcagtgagc tgtgactgta ccactgcact ccagcctgga tgacagagtg 22620 
agaccctgcc taaaaaaaaa aaaaaaaagg ctgggtgcgg tggctcatgc ctgtaattcc 22680 
agcgctttgg gaggccgaga tgggcggatc acgaggtcag gagatcgaga ccatcctggc 22740 
taacacggtg aaaccccgtc tctactaaaa gtacaaaaaa aaaaattagc cgggcatggt 22800 
ggcggacacc tgtagtcaca gctactcggg aggctgaggc aggagaatgg cgtgaacccg 22860 
ggaggcggag cttgcagtga gccaagatca caccaetgca ctotcagcct gggagacagc 22920 
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aacactccgt ctcaaaaaaa aaagaataaa acccatggct gggatggacc ctgaacctgc 22980 
agctgcagct gttcctgggt aggtctgtgg gcgacgtggc tttgcttctc catgttccca 23040 
agagacaagc atcacccatc catgagaaac aagcacatcc tcagggcgcc cttacgtgat .23100 
ctctggccaa tgaaccaaga caaagtgagc agacaccagg tctgggatgg caggtcccac 23160 
ccccaccagt gcccagtgtg ccctgtttgg aggtgaccac agggtgtgtg cccagaggct 23220 
gggcgtgact ctcagcggag accagagggg aaccacacca gcttggagga ctcagttccc 23280 
atcccagcca gctgggatga gccacaggac acaagggctg gcagacctat tgtgttttgt 23340 
ccacccttca cagcagagaa aggggacagt gcccagaatg tcctctgagg agcctcctcc 23400 
cactcttggt ccttgtaaaa tggtgctgac tcccttgctc ccttcttcct ggggtgggcg 23460 
gcaaacccca ttcccctcag ccttagcaag tgatttagaa acaggcagct cgcccaagcc 23520 
aggcatgaga gtgatcccgg gacacaggga gaacaagccc cgctttgccc tctgggggtc 23580 
tccattcagc agaagaggca aatgacagac acacagccgc ctcctccccc accatggtgc 23640 
tctgcagcct caggagcctc aggtgcacca agggccaccc catccagggg gccatgcttc 23700 
cttgagtggt atcgttcctg agcgagtacc atctccacct tccagagggg ctgtgacaag 23760 
atcaacaaga atgagggcat aggagcctcg aaccaaacat gccctcttcc ctgcagaggc 23820 
tgactgcgcc cagctgctat caccaagccc ctgctcctcc ggccccgtgg ggacagggta 23880 
ag^ggggtgt cacatggaac agctctccaa acagtccctc tcaagctgct gtctcctgtg 23940 
catctagtga gaacccaacc aacaaaggga aggtgggaat tgctattccc attaggcaga 24000 
tgagaaaact gaggccccga aaggctggcc tgttccaggt tacaggcgct gagcggctgc 24060 
tctgggaaca cacttggtgt ctgctgaggg cccgagcccg gccatcatat gactcaccct 24120 
tcgccagcaa agcccgggtg tgggtgaact tttcctggca gcctgggact ccaaggtgct 24180 
ggcagccagc ccagggaagg ctcccgcgtg cctgcggcag acgccttgct ttacctgcac 24240 
gtccccaccc ctaggagcct ggacagagcc cagaccctcc gccacctcct gagaaggtat 24300 
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caggggcatc agtctggact tgggggggaa tccacacagg ccttccccaa atgctccacc 24360 
gtggcccatg gaaaaggctg gaaaacgtgc aggagcagga gcctccgcat ggagcataat 24420 
tcacattcct tccccgagtt tcataacaga ggcctgctgg tttccttaaa tggggaattt 24480 
gcgagccagt cggtgaccag agactggttg gcgtggacgt gctcttgcag agtctcaaac 24540 
gctaccacaa gcccagccaa attocacgga ggaaaatcga cttccgaaga aaagagctgc 24600 
agcatggccttcgtgcagagccagctgcggttgtggttgtgtgttattttagggaagggc 24660 

cattttgcat tttaaagagg gggttgggtt tcaccctggc tttaatttga gacccggggg 24720 
ccactgcagc cccttgtcag gctggtacag gccggggact cctcccatgc taagccagtg 24780 
tctttctggc cccagatcct caggggccag agggtcatcc ccagagcccg ctctgccacc 24840 
cacatgggta ccctgggcct gggagggatg tgccttccct caaccctgcc tggatgtccg 24900 
cacggggcca cctgcattgc tgaaactgea acgaagtcga gtctcaggag gggcccccct 24960 
ggctgcaggg ctcttgatcc ttttggccac gtgcacactg aggtggacgc tcggacccag 25020 
agaccccctt catgatgatg gccggggcag gaaccccctc ctctgaggaa ggaccctggt 25080 
gggggacagc actgcaggag ggcacaggag atgacggggg ctctagcagg gccgggagga 2 
aggccaagat gctcctcgca accgtgtgcc tgtggccagg acagaggaca aacccaccct 25200 
ccactgtccc cactctcagg acagcagtcc tgccccagga ctcagcgccc acacttatgc 25260 
ctgaggacca ctattcaagt cagtatttgg cgagcagggg ttgctgccgc gggcgctgtg 25320 
acaggctgga atcctctccc tctccctctc cctctccgga gacatggagc ctacagggac 25380 
agagtcagca cctcagggta ggaccatggc tggcgtcatc agcatcactg gatctgatga 25440 
gtgggagccg gcatctcact gttttcactc tctcattcaa atgactggag caaagggaag 25500 
gtgtggggag aggcccagga atcaacacta aggtcaactt tgcccccagg ggcaggggtg 25560 
ggagtgaaca gccacaggtg tgatcctggg gagggcttct gggagagaat tcagaggcaa 25620 
gcatgtagag gaaccatttc aaatagttaa gaaaagccag agccaaacag ggacagttgg 25680 
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ctcgcagaga tgatgcaggc aaagccagct cagatctgag catgggaaag actactccca 25740 
accaagggcc cagcatctcc caaccaagca ccaagtacct cccaaccaaa tgccaagcac 25800 
ctcccaatca aatacctccc aaccaagcac ctagcacctc tcaactggac accaactact 25860 
cccaaccagg caccaagtac ctcccaacca agtgccaagc acctcccaac caagtaccaa 25920 
ttacctccca accaagcgcc tagcacctcc caactgagca tcatgcacct cccaacagag 25980 
catctagcac ctcccaactg atcacctccc aacctagcac cgagcacctc ccaaccaagt 26040 
gcagagcacc tcccaaccaa gtgccaagca cctcccaatc aaatacctcc caaccaagca 26100 
cctagcacct ctcaactgga caccaacaac tcccaaccaa gcgccaagca ccfcctaaca 26160 
aagtaccaat caccttccaa ccgagcacct agcacctccc aactgagcat catgcacctc 26220 
ccaacaaatc acctagcacc tcccgactga tcacctccca acctagcact gagcacttcc 26280 
caaccaacat agcaaaagcc ataaagaagt aaaaagacaa aaccacgtag gcatggag^ 26340 
tggacttctg gtggcgagga aagggcattt ttattataac gacagctaac atttgttgaa 26400 
ctcacaaact gttcttggtg ttttcctcat gacatgcagc atggtcacgc ctctgtacag 26460 
acaaggatac tgaggcacag agtggcaccg tgccaacctt gtctcatctt tttatcgaac 26520 
ctacatgcag agtgccagca aatccagctg tcttttctct tcagaacaga tcccaaatct 26580 
cgccactcct tacccccaca agtgaggtgt ccccgctgct gctttctgtc gccaggatcc 26640 
cggtaataac cgtggagagg gctcctgccc ccacgccacc caccccacag ctcactctcg 26700 
ctccagccac caggggatgc cttccagcac gagtcagagc tggcacctcc tctgctcgag 26760 
acctcatgtg tcctctcctc acaccttggg ccctgtttcc ctacattctg ctacagcccc 26820 
tcaaacaggc cccgccccaa accagcccag ggcctttgca ctggctgatc cctctgcctg 26880 
gaccgcgctg cccccagaca gccacacggt tctcagcctc atctgcttcc agtctcgact 26940 
caaaagtcac caagaggcct tcccagcacc tgagctccga cggaagcccc tcgccacagc 27000 
acccaagcac tgctttatcc ccctacgcac acgtcccttt caaatactat tcatttacca 27060 
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tctcctccca ctcactgaaa gggccagaga ctgggctata cccgctgcgt ggggagcagg 27120 
accaggcgca agggctcaca aatgcagtgg atgcctggtt gggaggtgag ggagctgcag 27180 
cgacccacgc tgggagggaa cgcaatgaca ggaggagcgc aggtcctggc gacacgatgg 27240 
ccatggcagc cgctggtgag caaccgcagg ccggccctgg gagagggctt ctagcaagct 27300 
gctatcttca gcctctccga ctactgcaga tgccccctcc tagccagaga cactgctaca 27360 
ccagccgacc cttccaaaaa gaaggtcagt aaccccgcga ctcctggagc cacagtgcag 27420 
ggggagaggg ctgagagggc aacagttcac caagcggaac agaggctgcc ccggaggtca 27480 
gctggctccc cggcagctgc aggggtggct agcccactcg gagggcagcg agggcatacg 27540 
aggggctcca gggatgagtg gttgcccagc acagcacccc tgggaggccg ggggcacttc 27600 
tcaggtagtg ggggcacgag gctgctctgg cctgacctca gggactcaaa atactttggc 27660 
gataaattcc accgtgtccc acccctgctg gtaccccata cttacacaca gactggttca 27720 
gatgcagaca ctctcgcgca catactcgct cacacgggca catacacgtg cacacacagt 27780 
cacatgcgca cactcataca cacacaaata tccactcaca cgcatgcatg cacacacacg 27840 
gacacacaca ggctcacacg tatgcacgca tatgcgtgca cacgcacaca cacacacaca 27900 
cgctcacatc ctcccactcc cacactcagt tgctcagaca cacacacgcc tggctctcac 27960 
acaaacctgt tgggctctga aaggctccag cccttcccat gctcgtcaga agccagtcaa 28020 
tggcttccta agtcaccaca cagatcaaag aggtgaactt ggccacatgg cactctgctt 28080 
cctgagctcc caaacaccag ccttggtgag gacagaccct caccccacac cctcattccc 28140 
actaccctgg gcaggcccag aggaggggca tctgcaggat ctggcaacca gcccctcccg 28200 
cccggctcct gcagccggca ccatgggagt cagggggagg tcactgcaaa gggcaacagc 28260 
aagttggtgg ccccaggact agagcccagg ggtcttcagt cctactccag agcttggaca 28320 
ctgtcccaca gggcatggcc aagggaaggg cttccagagc cctgacttca gggaggaggg 28380 
caggcgggct cctgtggcag gcctggatgc atggccgccc actcctggga ctttctaacc 28440 
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tagaatatct aggtcaggct gggtgcagtg gctcacgcct gcaatcccaa cactttggga 28500 
ggccgaggag ggtggatcac ttgaggttag gagtttgaga ccagcctggc caacatggcg 28560 
aaaccctgtg tctactaaaa atacaaaacc tagccaggtg tggtagtgca cgcctgtaat 28620 
cacagctact caggaggctg aggcaggaga atcacttgaa ctcgggaggt ggaggttgca 28680 
gtgagctgag atcgtgccat tgcgcaaaga agatctaggc cggcccctca accggtgagg 28740 
tccaggctgg gagtgctgag agactgtggt gacactgaat gaactaacag gcaaagggct 28800 
tccaactgag cctgggggtg gtgggaaatg gctcttgtgt tctagtcaag acctctgcca 28860 
accagttctg acactgaccc agcacagaac ctgacaggtc agcaagggcc agggcttagc 28920 
acagcccagg taagggtgtg tgtacggccc ccagagtcac tcccaggctg caagaaaagg 28980 
gacaaaggag ggacaagggg tggccaagca aactgttccc tctgctcggg agtctggggt 29040 
gacctggcct agctggccag tggagctggg ccacctcccc ttaaactctc caccccggac 29100 
ttcgactcca aagctttcct gccacccacg ctctccccac ctgggatcac ggccaggccc 29160 
tgagccttca agggcccagg tgaactcagc cagactagga gctgaggagg acacagggca 29220 
gcttccagaa cggacccgag aaccactecc agcaggttct gcttccagac aaggagctgc 29280 
actttttcag ccaatgcaat tagaaagcca ggagaaggtg caaattccac ctgcctgagc 29340 
gtccgcactt cccaggccgc ccaccataca cacagcaaag atgtgtttaa ccattcaaac 29400 
ccatggccaa ccacatcggt tgcctcagac atgcaagttt taaaaaggaa cataactatg 29460 
ggccaggcac ggtggttcac gtctgtaatc ccagcacttt gggaggccga ggtgggtgga 29520 
teacctgagg tcaggagttc gagaccagcc tagacaccat ggtgaaaccc catctgtacc 29580 
aaaactacaaaaattagctgggcgtggtggtgggcgcctgtaatcccagctacttgggaa 29640 
gctgaggcag gagaatcact tgaacccggg aggcgaaggt tgcagtgagc cgagattgtg 29700 
ccactgcact ccagcctggg caacaaggga gactccatct caattaaaaa aaaaaaooog 29760 
aaaaaggaacataactatggagtctcaaggggaagtaattccttcaacaataacaaatct 29820 
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tgaaagctga gctctttttt ttttttgaga caggatctcc tcactttgtc gcccaggctg 29880 
gagtgcagtg gtgggatcac agctcactgc agcctcgatc tcccaggctc aaatgatcct 29940 
cctacctcag cctcccaaga agctgggatt acaggtgcat accatcacac ccgattcatt 30000 
tttgtatact ttgaagagat ggggtctcac catgttgccc agtgtggtct tgaattcctg 30060 
gactcaggtg atctgcccgc cttggcctcc eagagtgctg ggattacagg cctgagccaa 30120 
cacccccacg ggttcatttt cagagtcgca ccgagtgctg gggttacagg cctgagccaa 30180 
cccccccacg ggttcatttt aagagtgaca ccgagtgctg gggttacagg cctgaaccaa 30240 
cccccccacg agttcatttt cagagtcgca ccgagtgctg gggttacagg cctgagccaa 30300 
cccccccacg ggttcatttt aagagtgaca ccgagtgctg gggttacagg cctgagccaa 30360 
cacccccacg ggttcatttt cagagtcaca ccgagtgctg gggttacagg cctgagccaa 30420 
cccccccacg ggttcatttt cagagtcaca ccctttttct gaaaaacaac ttgggctcat 30480 
gcaaattcga gagagagatg gtgacactcc ccgccccctg gacccaggtg gagtcgcagc 30540 
agggtttacc cgtgagcggg gtccaaggcg atggccctcg gctggtcaag gtcctgccag 30600 
aagagcacct tccgggatgt gccattgagg ttggccacct cgatgcggtt ggtctctgag 30660 
tccgtccagt acagcttctt gcccacccag tcgcaggcga ggccgtcggg agagaccagg 30720 
ccggagatga ccacgttctg cacggcggcc cccgtctggt tcaggtaggt ctgcttgatg 30780 
gcctcctcgc tcacgtctgt ccagtacacg gctcccttgg aaaactggaa gtccactgcg 30840 
gccgcatcct ccaggccgct gaccacgatg gtggactcca gcttgactcc gccggcgtcc 30900 
accagccgta cgtcccggcg gttggcaaat agcaggagcg gcgaggctgt ggggcagaag 30960 
caaaccgtga gggccactgg ctaagccagc aagatacaca gccctgggat ggagcactat 31020 
gcccagagca ctcctggtac tgccctgccc atgcccaaga cctccagttc cttcctccca 31080 
cccctaaggc gttgtcagga agttgcctgg gcagccccgg cccgcatcat tcagaggctc 31140 
ctgcagcgca gcaaacagcc ttcttcccac attcggtgac agcacctgtt tgtttaccaa 31200 
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ctgttacgtc tgttccccca gatatgggtg acccttcctg ccatgcccaa aacctcccac 3 1260 
atcgtcctcc agaggctaca ggggccctgt cctgttctgc agagaagcca catccccttt 3 1320 
gttggcctga cacaggggat ggggacatgc aggcacagca ctggccatgc tgctcgctac 3 1380 
agacccagcc acagggccac attttttgag gggttcagag cccaggccag acagagcctc 3 1440 
aagattccct tacaagtctt tgaccactgt ccaagctcag gcccgtttcc ttggccgtgg 31500 
catcagcttc ccatccaccc ctgtattcca tgtttctccc accctgcttc tggacattcc 31560 
tacatttaaa gggtcactct ggaatgccac cccttggctc agacaccttc cacagctccc 31620 
tgtgccagtg ccatgcagaa caaggtcaga ccccctagcc tggcctccaa ggccttggcc 31680 
tctggcctca cctacacttc tctccaccac cccaccccaa gcattcctga tctgcctgcg 31740 
gccaggctgg ctC5CCtcacc tccctgtgca ccgcagccct cagccccttc tgcctgtgca 3 1800 
agaagcctca tctcacagac aacggtctoa ttcccacaac gggctcaatg agaaatcagg 3 1860 
agaggccttc agaccatcac cccaccagac acctcagacg tcggaccagg agggtccagc 3 1920 
aacccccaac acagactcag agggactaag aagccacatg aggagtgaac acaagatgtg 31980 
gacaggagga ggttaagggc ctccagggag ctccatcagt ccgtgttctg ctgtcagcag 32040 
ggttaggctg ggctggccac aaacaccccc aaaaaacatc tgaagccttg gcttgaaaca 32100 
gctgacattc ctcatgaaaa ctgcagaccc ctgggtcctc ctgcgcagat gggggagccc 32160 
agccaacccc acactcccac cttcaccaag aaagagaaag ccaaaacaaa ctcaactcag 32220 
ccaatgacaa tcacagaact gaatcctgta gttagttcag ttggtttcat ttcagcaggg 32280 
gaaagatttg cagcctctat gagggtagct gggaacacaa agggccagag catggcccag 32340 
gagaccccag cgcagtgggg tagatggttc cgagcacagg cctccctgcc aagacaagca 32400 
ctggctcaaa tcctggcccc tcccattccc aggagacatg ctccacagga tgggaggaca 32460 
cacagaggac ctgaggccag gaaaatgaca gcggcgcctc cgccgcccca cccgtgctgt 32520 
catcatctta ggtctacagt tctttgtggc aacgagggac actgtgaaag tcaaacaaca 32580 
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ggaaggcata ggccacaaat aaagacaaac gggacttcat gggaagctaa agattttgtg 32640 
catcaaaaga cactatcgag agagtaaaaa ggcaacccac agaatgagag aaaatatttc 32700 
caaatcatag atctactaag agattaatat ccatgaaata cagagaactc ctaaaactca 32760 
acaatgagaa aacaactaag ccaactcaaa aatgggcaaa caacttgaac agacatttct 32820 
ccaaagatga catataaatg gccaataaac acatcaaaac aggcttaata tatccctaat 32880 
catcagggaa atgcaaatca aaactacaat aagataccat cttgcaccaa ttaggacggc 32940 
tactatcaaa aaaacaaaat agcaagtgtt ggtgaggatc tggagcaact ggaacccttg 33000 
tgcaccactg gcaaaaatgt gaaatggtgc agctactatg gaaaacagca tggcagttcc 33060 
ccaaaaactt aaacacagaa ttaccatatg acccagcaat ttcgctttgg gttatatacc 33120 
caaaagaact gaaaacaggg acacaatcag atatgcatac accttggatc acagcagcat 33180 
ccttcccaac agctaaaaca tggaggqagc caggcatggt ggctcacgcc tgtaatccca 33240 
gcactttggg aggctgaggc gggtggatca cctgaggtca ggagttcgag accagcctgg 33300 
ccaacatggt gaaaccccgt ctctactaaa atacaaaaat tagctgggcg tagtgacggg 33360 
cacctgtaat cccagctact cacaagtctg aggcaggaga atcacttgaa ccctggaagt 33420 
ggacgttgca gtgagccaag attgcgccac tgcattccag cctgggtgac acagcgagac 33480 
tctgtctcaa aaaacagcaa aacaaaaaca aaaaaacaaa caaacatgga agcaacccaa 33540 
gcgtccctct actgagggat gaatagcggg gcaaaatctg ctccatccac acaatggagt 33600 
actattcagt ctcaaaaagg aaaaagattc tggtcaggca cggtggctca tgcctgtaat 33660 
cccagcactt ggggaggctg aggcgggtgg atcacctgaa gtcaggaatt caaggcccgc 33720 
ctggccaaga ctggcaccna gctacacana aagtatangg ccccggaaa 33769 



<210> 9 
<2il> 72049 
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<212> DNA 

< 213 > Homo sapiens 

<220> 

<221 > unsure 

<222> (8356),(8385),(38585) 

< 223 > Identity of nucleotide sequences at the above locations are tinlcnown. 



<400> 9 

tataccttgc gcggaccttc ggctcctgtg gtgaagacaa tatgaagaaa atagaaatta 60 
cccataattt tgccacacag acttagttgt gtccatgtat cttgtgcacc ttttttctgt 120 
ttacggatca aaatcgactt ttagggtcag gcgcggtggc tcacacctgt aatcccaaca 180 
ctttgggagg ctggagttgg ggttgggggg tggatcactg aagatcagga gtttgagacc 240 
agcctggcca acatggcgaa actccatctc tactaaaaat aaaagattag ccaggcgtgg 300 
tggtgggtgc ctctaatccc agctactccg gaggctgagg caggagaatc gcttgaaccc 360 
aggagacaga ggttgcagtg agccaggatc acgccactgc actccagcct ggcaacagag 420 
cgagactctg tctcaaaaaa aaaaataaaa ataaaataaa taaatacata aattgacttt 480 
taggagattg gttcaaacaa tgtgtgtaat gttgtgtctg agtgtttttc atttatcgtt 540 
catgcaaatt ccgacatcat tcactcttct ccagagtgtg ctgtmcct gcctgtgtca 600 
tcacccgtca ccttgaatgc cctcgtttag gtaaaataag tacattttat tcaaaaatat 660 
ttgaggacat ttgggttgtc tccaggttct tggtcttgag ttttgctgtt cttgtggagc 720 
catggtggtg tctggttgca ggaacctcca tgcgttccag ctgctgcttc tgcctgtgtt 780 
cttagagagg aaatgctggg gtccgcggtt cccgggctgc tgaccaggaa gcctgcggtg 840 
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ctttacggcc cttccagaag cgggagatgc ccccacttaa gtgtcagaca ggcctttcca 900 
cctcactggc agctctgagc ggctcccttc tatttgcaga tgactgagaa gttacx:aatt 960 
tccacgttta ctgactgctg tttctcctgt taatttgtat ttatagtctt cgctaattta 1020 
ttgctagggt tttggtgttg tccctattga cttgtatgcc ttttaatttt ttaaacaaca 1080 
ttaatatact tcattttttt agagcagttt taagtttaca ggaaaattaa gggacaagta 1 140 
cagagagttc cttccacctg ctgtcctcct ctcctcctcc ccaccttccc tccttcccct 1200 
attgtaactt tctttctgat attataaaag tcactcatgg ctgggcgtgg tggctcacgc 1260 
ctgtaatccc agcacgttgg gaggcagagg caggcagatc acctgaggte aggagttcca 1320 
gaccagcctg gccaacatgg tgaaaccccg tctctactaa aaacacaaaa agttagccag 1380 
gcgtggtggc gggcacctgt aatcccagct actcaggagg ctgaggcagg agaatggcgt 1440 
gaacctggga ggcagaggtt acagtgagtc gagatcgcgc cactgcactc cagcctgggc 1500 
aataagagtg aagcttcgtc tcaaaaacaa agtcacacac gcttcttgta cgagggtcat 1560 
ttggccgagg ggccagatgg ctcaccatct agttgggaca ggccatgagc tcggaatgct 1620 
ttttacatat ttacatggtt gagaagaaaa tcaggagaat aatgttttgg gacatgggaa 1680 
aatgacatgg aatttgcatt ttagtgtcca taaatgaagt tttgtttgct cccagctgtg 1740 
ttgactgagg caggctggct tcctacagct gcggcagagc tgaggaggcg ggaaggagac 1 800 
cgtgcaggcc gcagcaccga aaatatttgc tctctggccc ttcccagagt gcttgccgac 1860 
ctctgtccga cagctagaag gaaggatagg acccgtccga cgataaccac tgttgacatt 1920 
tgagcgcgtt tccttcccgg cttttgtgtg agagtggcag tctgtttgct tttgtggtcg 1980 
ggatctgctg cacgcacggc gggctgtttg catgaggctt cctggaggat agggctgggc 2040 
tcggagctgc acgcagtggg gcgtgtcctg catgcagtgg ggcctcagaa gagagctgtg 2100 
gtgggcgggg cagtgccaac gctggtgggt gccaggcctc cacgctcaga tcagccccgg 2160 
cgacaggttt gggccaccxt ctctctggcc tctgtgcagt ggcccaggcc gtctgctctg 2220 
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cctggcacac ttgcctctgt ccttccactg aagcgctcct cttaccctct gctcccggct 2280 
gggtacgttg aattgtgtcc ctcaaggaga tatgctaaag gtctaacccc aggaacctgt 2340 
gtatgtgatc taatttggaa acagggtctt ggctgatgta atcaagcgag gatgaggtca 2400 
ccctagagta gggggcctat atccacggtg ctggtgtcct catgagagca ggtgagcaga 2460 
cactgacact caggggtgaa ggctgcatgg agtcagaaca gggcttagtg cgatggcggc 2520 
cacaagccaa ggaactccaa gtatttcctg caacaccaga agctggaaga tgccaggaag 2580 
gatcctgccc tggagccttc ggagggagtc tgtccctgca gacgtcttga cttttgattg 2640 
cagggatgca tgtcttaggg tgtgtggggg ggtgcatttc tgatgttaga agccacctgg 2700 
ttggtggcga tgtgtcacgg gagccctctg caggttctgc gtgtx:catgt ggtcggggac 2760 
agaggtgggc agggacggac ggtgtcgagc tggacatgtc catgacgtcg gccatccctt 2820 
gggatggctt ttttgttttg aggataaggc tgcctgccag gaagctgtgc cctgcctggc 2880 
ccttgcccca agcccctggc ctgtgcttgg cctcgcggaa gggatgtcgc ccttctctcc 2940 
tgcatgcgtg cagggaggaa ggggagaggt cagcagcxcg cctggaggag gctcgggcga 3000 
ggggaaggtt tcactttcag gcaatgttgt ggggctgttt aaacaacccc aaagaaaacc 3060 
atttggccaa actgttagtt tccaaacatt ttacttcctt ggtgtttaaa taaattccta 3120 
ccaagactct gtagctggtc ccagggaagg agttggcctc tcttctttat agcccggcac 3180 
agtcagtccc ctgcacctgc ccctcccaac cccaggcctg cttccccgtg gccatggctg 3240 
ctgcccggac ctctctacac acagaacctc ctggaggcca gctgtgggca ccagccttgg 3300 - 
cagggctgtg gcggagccca ggctgctggt actctctctg cagctgctcc ctgctggcct 3360 
ggctggacag cgtccccacc accactgggg tcacctctgt gctggtcaca gctcactcag 3420 
accttcaggc aaatgggttg gatcctgcct ctctcccagg tgtctcagtc tctgcaaaac 3480 
tcaaaaacct cagaggcctt gcagcctgag gggtgtcaga gacacctcct tcgaatcagt 3540 
aaacacctac agattcaccc cagcagtgaa aggactgctt cgccacagag gtttgattta 3600 
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ctcctaagta attggaaggg atgccgagaa taggttcctc atggtgggac tagaggccct 3660 
ctgctgacct agttaacaga gggctagggc tgggtgtgct cagcccctga aggttctagg 3720 
cccatttggg acaccccgcc agaacctgcc acaacctgcc atgtggtgac agctacctaa 3780 
atcccagagg ctcttgagct ggagagcaga cctctcaatc tcagcaggcc ccccacacag 3840 
accccataac cctagtctgc cttcacagta cagttcgtgg ctatgtgttc acggatggtg 3900 
ttgttcacct aaggtctctg ccctgtgacc ccaagggcgt cctgagggca gattccaagt 3960 
ctgtttcgtc cacccctcct tccctagcag cgggtccagg gcctggcctg aactagcttc 4020 
ccacagagat actggtggga tgatgaaggc agccaggcgg caagtgaaaa acgcacttcc 4080 
tgcatgtgct ggctcctggg attgaagtgt ttgaggaagc aaagtgaagt gagctttcct 4140 
cttgcggctg tgtgtccttg ggccgggagc ctaccctctc tgagcgttgg ggtccttgtc 4200 
agtagaatgg ggcatcctca tagctcaagg ggtggtgtgt gaaaattgtg ctattgtgtt 4260 
actttaatga tttttttttt ttcgagacaa agtctcaccc caacgcgcag gctggagtgc 4320 
agtggcgcga tctcagctca ttgcaacctc tgcctcctgg gttcaagtga ttctcctgcc 4380 
tcagcctccc aagtagctgg aattacagga gtgcgccacc aggcccggca tatttttcta 4440 
tttttagtag agagggggtt ttaccatgtt ggctaggctg gtcttgaact cctgacctca 4500 
ggtgatccac ctgcctcggc ctcccaaagt gctgggatta caagcatgag ccaccgcgcc 4560 
cggcctactt tagtgatttc ttaggaggac agagggaacg ggctggcaag acaggcttgg 4620 
aatgtgtttt gggatcaagt gccggtttct gtctggcact ggcgttctct gtggggccat 4680 
gatggacaca ctgctgaggt caagcgtgat tcgtcttgcg ctgtgcctgg cagtctcatt 4740 
ggaaagttct gtagacatcg tgtggatggg gctcttcccg gccaagccct tggggacctt 4800 
ccaggactgt gatctcccca cagtggctgt taagcaggga cctttcgtga agtggagtct 4860 
ctggtcccct ccaagtcata gctagacagg gactcgggca tcgccaagcc tggctgatta 4920 
ttcactggat gaggagacag gcccagagag gggcaggaac ctgcccgagg tcacccagca 4980 
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ggccccagag gtttcggtct cggattctcc ctgctcatcc ctggatgtag tgctgctgtg 5040 
gatgtggttc tgtgctgggg gctgtggaga gcagggggct tgtgccagga ccccagtgag 5100 
ggtggcgccc tcgccatgag gccgactgtt ggtatggggc ggccatccac tggggtgtgg 5 160 
ggaggaacag ctttcctgag gaggaggtgg cgggaggaac agcttccctg aggaggaggt 5220 
ggcggtgctg tgtgacctgg gccttgaagg acaggtccat tgtcaacaga acattttggg 5280 
agtggagcct agagggagaa aatttgttga aattcagatt cccctccccc taccaataca 5340 
caccaaatca gatgcccctg accagatcta aatttggctc tcagagattt ccattgtagc 5400 
tgggcacttg gggaaccttc taagtgctgc ctctgcctct ccccagcctg cctgcctcag 5460 
tttccccagc cctgggcccg tgtcgctgtt gccatcacgt gggcgccctc tagtggagga 5520 
atcagattat gcactccggg gcttggagca ggagtcagga ggggctcctg tctttccttg 5580 
aaacgttgga tgccgggatc ctggaacagt ctctgcattc ctcctggcga gaaccagagc 5640 
ctgggcacag gggaccatct gttgtttgaa ggctgcagcc tggcagggca ctcaggagat 5700 
ctggcagttg gctgcagggc caggtctagg ggccagggca tcagggaggc tctgggctgg 5760 
ttcagccccg ggcccctttg cagattgtga cctgggcccc tgtgcagggg catggccaca 5820 
ggatgctggg aggggtctct gaccctgacc ttcttggctc tgtgcatcct tgagaccaga 5880 
aaggtctgga acaaatgagt agacgatgcc ctaacctggg gagggagcca catcctgatc 5940 
ccagcaacct cgggaaggat ctgtcaggat tatggggcac cctgggggcc ccaagtctgc 6000 
atgggtctcc acttgcaatt tctgtaggaa gctctgataa atccaaactg ggggtcctag 6060 
gacacagtca gaaatgctga taccgttgtg tgtggagcct cgggccctgg gggtcaggag 6120 
catgtggagg gtgggccacg ggggttcaga agagaatcct gtaacccccc accccccaaa 6180 
ctgaagccca cttgagggcc atggctgaaa ggttgggggg tctccgtgcg tcctgtggag 6240 
tgggtggtga ggagtccttg ggtttgcacg cctctgggcc tgagcggcgg gaccccgtcc 6300 
acagcggatc cctgggccct gttgctcaga tgctctcaga^gtgttgctgt ggccacggag 6360 
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ggagcctgag tuagcnct cttgtgccgs ttgtacgctg tcaggttaca cggtgagtt 6420 
aggcagggoa cagatgccca gagcagaggg aactt^« ggSgattcaa cacgtgcaag 64S0 
.cnaggggc ^gcaaatcc tgccccagc tagagagggg gcMtatt. gagaccagaa 6540 
«acc«agc atcctcctg. occcagctg. gKcagcCg Ktgcaggga ca.cc.gaga 6600 
ggaccaggct accccca. ccacCgcc. aagtgccac. c«aaccc.g .ccao«gtg 6660 
ccg.ggaggg gcgtsaccc aagCgCca gccagcagca ggcttggccc .ggggSgcag 6720 
cagagaccca gg.ggc<g.g ggM8S«g- "^tggcg. gg«cgaaa cttcgttgga 6780 
ag,g,g.ggacag«ccttgcc.gttcK.g.gggaccc.amagaaacgaggtc.gag 6840 

mctggggg .ca.caag. gttctga.gg cccagag.g .ggaggccgc gg.gcagccc 6900 
ca,ccaaggagccagggccc,ggg«agccg.gaccagaa.gca.gcccoggagg.gn 6960 
.c.ca«*gcacc.g«ttgcc.gg.g«tcaagtggtcg«aaac«gWUgcto 7020 
„gg.g»cc«aaag.gcccccgggtc«aggcc.cagaaccaggg»ccc.Ka.c. 7080 

cgg.ggcc.g ggagca.c.g ggcagagag caaagagggc gattcacttg aagga.g.g. 7140 
c^gccctgc caggagccc cccggcacgg .gcggggcc «aagc«cc c,cggg.gg. 7200 
ggagaggagg gagcga.gaa g.ggcg«ga gaggi=agg aagggtgagc ccctgcaagg 7260 
.gggca.gC ggggacgCg agcagca^g c^gcagcg ggtctgcagc c.gg.acccg 7320 

gcgggac^g .gg«ggggc .ggt^gtgg -aggagagg ggctggcagg agacaagggg 7380 
gac.g.gaggcag«cccacccagcag«gaagcccaa.ggcc.ggc<g.g.ggct«ca 7440 

gctgcg«camccKtoag.gcttcag.Wctcamgoaaa.gaggaaacaaaca 7500 
gtgccagcc. cccagagg.g K^aigagga. gaacgag.ga ccatgmgca tgggCggg. 7560 
gcgtgttacc oacaBacc agccmgca aggagagocc «ggggcc.g gctgagtatt 7620 
^ccc ggcccacccc aggccugac ttg.gcc.gc «caggccc. .gacocc«a 7680 
ecccattgcacc.g.c.ccacaggagccgaggagg«c«agc.ggcocggcggacgga 7740 
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cctacggagg atctcgctgg acacgccgga cttcaccgac atcgtgctgc aggtggacga 7800 

catccggcacgccattgccatcgactacgacccgctagagggctatgtctactggacaga 7860 
tgacgaggtg cgggccatcc gcagggcgta cctggacggg tctggggcgc agacgctggt 7920 
caacaccgag atcaacgacc ccgatggcat cgcggtcgac tgggtggccc gaaacctcta 7980 
ctggaccgac acgggcacgg accgcatcga ggtgacgcgc ctcaacggca cctcccgcaa. 8040 
gatcctggtg tcggaggacc tggacgagcc ccgagccatc gcactgcacc ccgtgatggg 8100 
gtaagacggg cgggggctgg ggcctggagc cagggccagg ccaagcacag gcgagaggga 8160 
gattgacctg gacctgtcat tctgggacac tgtcttgcat cagaacccgg aggagggctt 8220 
gttaaaacac cggcagctgg gccccacccc cagagcggtg attcaggagc tccagggcgg 8280 
ggctgaagac ttgggtttct aacaagcacc ccagtggtcc ggtgctgctg ctgggtccat 8340 
gcgtagaaagccctgBaaactggagggagccctttgtccccctgncttcagtttcctcat 8400 
ctgtagaatggaacggtccatctgggtgatttccaggatgacagtagtgacagtaagggc 8460 
agcctctgtg acactgacca cagtacaggc caggcctctt tttttctttt tttttttgag 8520 

atggagtctcactctgtcgcccaggctggagtgcagtggtgtgatctcagctcactaca^ 8580 
cctctgcctc ctgggctcaa gtgattctcc tgcctcagcc tcctgagtag ctgggattac 8640 

aggtgcctgccactgtgcttggctaatgtttgtatttttggtagagatggggtttcaccg 8700 
tcttggccag gctggtcgca aactcctgac ctcaggtgat ccacctgcct cagcctccca 8760 

aagtgctgggattacaggcatgagccaccacgcccggtcaggccaggcctcttttgaaca 8820 
ctttgcacac catgggtctt ttcatccagg ggggtaggta cagttgtaca gttgaggaca 8880 

ctgaagcccagagaggctcagggacttgcccagggtcacacagcaggatgtggcaggtgt 8940 
ggggctgggc ctggcagcgt ggctccagct ttccagcata gaaatctgtg aaagcagata 9000 

gtttgtcggtcggtaggggagactttctgagacccgcxccagcggctcagagggtagta^ 9060 
ccaggggcct tcctgggggc tcataaccca gaacactgaa tgggaaaacc ctgatggagg 9120 
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aggogcag.ggagctg.gggtgccga.gggaagtcccagaggagctgggaggtcagUgc 9180 
gg.gc^cccWgcactagtgggcac^gM'E^-S^ttcatggccct 9240 
gggacctgaagacagaagstgaagtaacttgcccagggoacccgtcgggcagcggcggg 9300 

cagaggatog.gggctg«gagccgtgc.cg«gcccagecc«gggg«gtgagw 9360 
gctggccggggagc»tcctgcaagtggactgg.g.«ggagccagca.gtcaggoag 9420 

ca^cagcgg gagtgcagca ggcagcggga gcacagcagg cagagggogg ggCcgagca 9480 
gccatccgtg gaccc^ggg cacggaggca .gtgggapg ggCgcKca tggcagtggc 9540 
tgaagggag ggngtg-c cgaggaggg. ggatgagggt aagaagtggg gtccccaggg 9600 
gcmagcaa gaggaggccc aggaactgg. tgccagctac agtgaaggga acacggccct 9660 
.gagg.caggagcttggtcaagtcac.gK.acatggg^cgg.gtcctca.c.gtgaaa 9720 
aaggaagggamgaagc.gac.ccaaggcccc.cc.agccc.ggt«ca«ag«ga 9780 
gga««agggacatgggct.ggoagtctgacagCgagg<cg.ggggtccagggagggg 9840 

caccgagctg gaagcgggag gcagaggggc .ggccgg«g ggt-gacac agctgaagca 9900 
gaggctgtga«.ggggcc.cagaaccttcacccc.gagctgccaccccagga.ctgggt 9960 
tccc.cct.ggggggccccagggaaoaagtcacctg.co.ttgcataggggagccottca ,10020 
gcOtg.gcagaaggttotgcWgccccttc««:ctctaggtgc«:agctcc,ccagc 10080 
cca«agtcagatgtgaggc.gccccagac^cagggt.afflCg.ccactgacc 10140 
mgggatgggagatgagc.«tggcccc.gagag.ccaagggaggtg.gg<gaaaooc 102(X) 
gcacaggpg gaagtgggca .cccgtcco aggggagccc ccagggactc tggtcaagg 10260 

gcttgccgct ggcatgct^ IX^S- '^^^ ^"^''^ ■° 

acamaoaaacaccgacat^accgaoaccgacamaccga^ctgacatttaccaac 10380 

artgttmceaacattgacatc.aclga.a«ggoatctaccaaca«gacamaccga 10440 
cactgacatttaccaacaaatttaccaaca.tgacatotactgaoattggc^ 10500 
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acaccaacat ttaccgacac caacatttac caacactgaa atttaccgac accgacattt 10560 
accgacaccg tttaccaaca ccgacgttta ccgacaccga catttaccga cactgatatt 10620 
taccaacact gacatctact gacgctggca tctactgaca ccgatgccag catctaccaa 10680 
caccgacatt taccaacact gacatttacc aacactgaca tttaccgaca ttgacattta 10740 
ctgacactga catctactga cactggcatc tactgacact gacgtttacc gacactagca 10800 
tctactgaca ctgacattta ccaacaccag catctaccaa caccgacatt taccaacact 10860 
gacatttact gacactgata tctactgaca ctggcatcta ctgacaccaa catttaccaa 10920 
caccagcatc taccaacacc gacatttacc aacaccagca tttaccaaca ccgatgttta 10980 

ccaacgccgacgtttaccgacgccagcatc taccaacact gacatttacc gacaccgaca 11040 
tttaccgaca ctgacattta ctgacactga catctactga tactggcatc taccgacact 1 1 100 
gatatttacc aacgccagca tctactgaca ctgatgttta ccaacaccga catttacgag 1 1 160 
caccgacatt tactgacacc aatatttact gacatcaaca tttagccatg tgatgggggc 1 1220 
cggcttgggg gcaggccttg ctcttggcac tggggatgct gcagagacca gacagactca 11280 
tggggtcatggacttctgcttcttctccagcctcatgtactggacagactggggagagaa 11340 
ccctaaaatc gagtgtgcca acttggatgg .gcaggagcgg cgtgtgctgg tcaatgcctc 11400 
cctcgggtgg cccaacggcc tggccctgga cctgcaggag gggaagctct actggggaga 11460 
cgccaagaca gacaagatcg aggtgaggct cctgtggaca tgtttgatcc aggaggccag 11520 

gcccagccaccccctgcagccagatgtacgtattggcgaggcaccgatgggtgcctgtgc 11580 
tctgctattt ggccacatgg aatgcttgag aaaatagtta caatactttc tgacaaaaac 11640 
gccttgagag ggtagcgcta tacaacgtcc tgtggttacg taagatgtta tcattcggcc 11700 
aggtgcctgt agacacagct acttggagac tgaggtggga ggatcgctgg agtccaagag 11760 
tttgaggcca gcccgggcaa aggggacaca ggaatcctct gcactgcttt tgccacttac 11820 
tgtgagattt aaattatttc acaatacaaa attaagacaa aaagttaatc acatatccac 1 1880 
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tgccctgctt aagacagaaa acatgggtgt tgttgaagcc agaggcagct gctggcctga 11940 
gtttggtgat tggttcctaa gcagttgaag gcagttttgt ttttccatag atgtctgttc 12000 
tccctttgct gggtgcagcc tcgccctgct gctgtggtcg ggtttcagtg gcctcgtccc 12060 
gtggacgcag cctcgccctg ccgctgtggt cgggtttcag tggcctcgtc ccgtggacgc 12120 
agcctcgccc tgccgctgtg gtcgggtttc agtggcctcg tcccgtggac gcagcctcgc 12180 
cctgccgctg tggtcgggtt tcagtggcct cgtcccgtgg acgcagcctc gccctgccgc 12240 
tgtggtcggg tttcagtggc ctcgtcctgt ggacgcagcc tcgccctgcc gctgtggtcg 12300 
ggtttcagtg gcctcgtccc atgggcgtgc tttggcagct ttttgctcac ctgtggagcc 12360 
tctcttgagc ttttttgttt gttgtttgtt tttgtttgat tttgtttgat tgtttgtttt 12420 
tgttgtcgtt gttgttgccc aggctggagt gcagtggcgc gatctcagct cactgaaacc 12480 
tctgcctcct tgggttcatg ccattctcct gcctcagcct cccacatagc tgggattaca 12540 
agtgcccgcc accacgcctg gctaaatttt gtatttttag tagacagggg gtttcaccat 12600 
gttggtcagg ctggtctgga actcctggtc tcacatgatc cacctgcctc ggcctcccaa 12660 
agtgttggga ttacaggcgt gagccaccgc gcccagccct ctgttgagca tattttgagg 12720 
ttctcttggt gccagtgata tgtacatgtg tccccatcgc accatcgtca cccattgagg 12780 
tgacattggt gcctctcctc ggggtggatg cctccctctg tttccagcaa cttctgaagg 12840 
attttcctga gctgcatcag tccttgttga cgtcaccatc ggggtcacct ttgctctcct 12900 
cagggctccc aggggaggcc cgaatcaggc agcttgcagg gcagggcagg atggagaaca 12960 ' 
cgagtgtgtg tctgtgttgc aggatttcag accctgcttc tgagcgggag gagtttcagc 13020 
accttcaggg tggggaaccc agggatgggg gaggctgagt ggacgccctt cccacgaaaa 13080 
ccctaggagc tgcaggtgtg gccatttcct gctggagctc cttgtaaatg ttttgttttt 13140 
ggcaaggccc atgtttgcgg gccgctgagg atgatttgcc ttcacgcatc cccgctaccc 13200 
gtgggagcag gtcagggact cgcgtgtctg tggcacacca ggcctgtgac aggcgttgtt 13260 
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ccatgtactg tctcagcagt ggttttcttg agacagggtc tcgctcgctc acccaggcga 13320 
gagtgcagtg gcgcaatcac ggctcgctgt agcctcaatc tccctgggct caggtgatcc 13380 
tcctgcctca ccctctgagt agctgggact acagacacat accaccacac ccagctagtt 13440 
tttgtgtatt ttttgtgggg ggagatgggg tttcgctgtg gtgcccaagc tgatctcaaa 13500 
ctcctgaggc acaagcgatc cacctgcctc ggcctcccaa agtgctggga tgacaggcat 13560 
cagccgtcac acgcagctca atgattttat tgtggtaaaa taaacatagc acaaaattga 13620 
tgattttaac cattttaaag tgaacagttc aggctgggcg tggtggctta tgcttgtaat 13680 
cccagtactt tgagaggctg aggtgggcag atcacctgag gtcaggagtt tgagaccagc 13740 
ctggccaaca tgatgaaatc cagtctctac taaaaataca aaaattagcc gggcatggtg 13800 
gcaggtgcct gtaatcccag ctactcggga ggctgaggca ggagaatcgc ttgagcccgg 13860 
gaggtggagg ttgcagtgat ctgagatcat gccactgcac tccaatctgt gtgacagagc 13920 
aagactctgt cttgaaaaat aaataaataa aaaaaatttt aaaaagtgaa caattcaggg 13980 
catttagtat gaggacaatg tggtgcaggt atctctgcta ctatctactt ctagaacact 14040 
ttcttctgcc ctgaaggaaa ccccatgccc accggcactc acgcccattc tcccctctct 14100 
cccagcctct gtcaaccact aatctacttt ctgtctctgg gggttcactt cttctggacg 14160 
ttttgtgtga ctggaatcct gcaatatgtg gtccctgcgt gtggcttctt tccatagcat 14220 
tgtgttttcc agattcaccc acacattgtc gcacgttatc agaatctcat tcctgactgg 14280 
gtgcagtggg ttaggcctgt aatcctaaca ttotgggagg ccaaggcggg acgatcactt 14340 
gaggcaggag tttgagacca gcctggccag cctagcaaga ccccagctac caaaaaattt 14400 
taaaagttaa ctgaacgtgg tggtggtggg cacttgtggt tcccagctac ctgggaggct 14460 
gaggttggag gatcgcttaa gcccaggagg tcaaggctgc agtgagctat gatcgcacca 14520 
ctgcactcca gcctggacaa cagagcaaga ccctgtctga aaaaaaaaac aaaaaaaaaa 14580 
gttcctttct ttttgtggct ggatgacatc ccattgtatg gccacagcac attttgtttg 14640 
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,o.gtt.a.cgEgtggtgggcag«g«ccacc««gtc.c«g.gaaUa«c.gct 14700 
gtgaacamgaatecaagttmgtttgaacacotgttgtgaattatllggatatatgt 14760 

gtaggggtag gattgagag tccBtggta a^taggtt .gac«ac« aggaaccatt 14820 
aaactg«t.caacagtggc.gcgccgttotgcatccccaccggcag.g.g«agggttc 14880 

-tgacmacc tcctcacaaa egcttcmt ccatttaaaa aaatattcag ccaggtgcto 14940 

tggc.cacgcctg».cccagcacmgggaggocgtggcgggcgga.cacctgaggtc 150O0 
aggagttogagaogagcctggccaaca«g.gmcccca«cUccaaaaa«maa 15060 

attagccggg tgtggcagcg ggcgcOgta aKcoagcU cttgggaggc Igaggcagga 15120 
gaatcacttg aacccgggag gcagaggttg cagtgagcca aga^gcgcc acucaco. 15180 
agcc.ggg.ga-agagtgaaactccatctaaaa.aaaacaaaaataaaaa,aaamaa 15240 
ataattaaaacattcatcacagccagcctagtgggtgtcccatgtggctttgcacgca 15300 
mccctgataactaggatgc^agcgtCStcccaggcttgccacacctcagcacm 15360 
gagatacgtcgcacagtccccatttgcgaacgagaaatgaggmagggaacagcagcg 15420 
.g.catgtcaca«gcgagcaggggg.ctc.gagccg«gaccccacagccgaccaagc 15480 
ttcaatecttaccgcctcctagtgttgtggatgtagcccagggtgctcccacamttca 15540 
gatgagaacaccgaagcKaaaacaggagcgtmgtccacat«ga.acacgaWc.g 15600 
tggmggKagaagtcactttatatctcagtggtccagactggagaggacagggggt 15660 
«.«gggaatggggaaggtg«cagg«aaaggaaggaattccagattc.cca>ac«t 15720 

cottgggaag ttagaagact cagagggtc, ggcaaagtca gacaaagcaa gagaaatgca 15780 
gtoaggagga agcggagctg ttcaggaaca ggggggtcgc aggagctcac ccccaggaac 15840 
\a«cttg«ggggccttcgtgtcacaatgacgtgagcac.gcg.g«gattacccactt 15900 
Mmmtmgaggtggagtotcgoctcttgcccagtctggagtgcagtggcacga 15960 
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gcgtagctgg gactacaggc gcctgccacc gcgcccggct aatttttgta tttttagtag 16080 
agatgggatt tcactacatt agccaggatg gtctcgatct cctgacctca tgatccgccc 16140 
gtctcggcct cccaaagtgc tgggattaca ggcgtgagcc accgcgcccg gcccgatttc 16200 
ccactttaag aatctgtctg tacatcctca aagccctata cacagtgctg ggttgctata - 16260 
gggaatatga ggcttacagg ccatggtgct ggacacacag aagggacgga ggtcaggagg 16320 
tagaagggcg gagagaggga acaggcggag gtcacatcct tggctttcaa aatgggccag 16380 
ggagagacac cctctgagca tggtaggaca ggaaagcaag attggaacac attgagagca 16440 
accgaggtgg ctgggcgtgg tggcttacgc ctgtaatccc aacactttgg aaagctgagg 16500 
tgggtggatt gcttgaggcc aggagttcaa gaccagcctg gccaacatgg tgagaccccg 16560 
tctctactaaatatacaaaaattagccaggcgtgatggtgcatacctgtaatcccagctg 16620 
cttgggaggc tgaggcagga gaattgctta aacctgggag gcggaggttg cagtgagccg • 16680 

agatcccgccactgcactccagcctgggccacagagtgagactccatctcaaaaaaaaaa 16740 
aaaaaaaaga taaaaagacc aaccgaggaa ttgaagtggg ggggcgtcac agtagcagaa 16800 
gggggatcgt ggagcaggcc accctgtggt catgcactgg aagctcatta cctgacgatt 16860 
tggagctcat cactgggggc ctaaggagaa tagatactga aggatgagga gtgatggcgc 16920 
ggggcacggg tgtctttggt ggccagaact tggggactgc tggggtgcct cactgcaggc 16980 
cttctcagcg ccctttatat gcttacacag gctgtttcta agagggggat acattgcata 17040 
agcgttttcagactacctcatcatgggtccctttctttaccctc^^^ 17100 
cactctctgg gaaggtgcag gtggatgccc agacccgccc tgccatccac ctgcacgtcc 17160 
agagctgact tagcctcgag attgctgctg gcacctcctg ccccgggaca cctcggatgt 17220 
gcccgtggag atgctggctc tgtgttttct gctggagttt ggtgcgtctt ttcctcctgc 17280 
aagtggccac cgctcttggg tatgtcctca ggcttctgcg agtcatggct gcttctcagg 17340 
tcctigcccagcgcxaggagcaaaccxtcctggcactttgttcaggggtggatgc^^^ 17400 
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tgttcctgct gtggaccgcc atctcacatg agggtcttgg gcctgcaggc tcgttcagga 17460 
aacacccgct gagtatgcag tgtgtgccag ctgtgtccca ggcaatggcg gggacagtgg 17520 
ctgctgctgg ggttgtggtg gcttctgggg actctgggga cagctgaggt gcaaggagcc 17580 
acggctcctt gaggatgcag ttggactcca ggtggaaggg atggttgggg gaggtataaa 17640 
tggggtcagg gaggagacac atttggaaca atgggaacat ttttaagatg ctatgtcggg 17700 
aggcaacaag gtggccaacc caggtgctga ggagcccaca ccagccctgg acgtgttttg 17760 
ccgctcacct ttgctgggga gtggtgggag agaggattcc gttccacgtg gtggtgtgcg 17820 
cagctgggct gtgtggagct gggcgctagg aggaaggtgc tttctgcggg gctagccggg 17880 
ctctgccttt gaacacaatc aggctccagg ttttcagcat ccagtgcatg agaggacttc 17940 
acgggcagct gtggctgatc ccttgatgaa ttgggagaag aacaaaggtc tatgaaatga 18000 
ggtttcatgt agatggcatt agagacgccc acaacagatt tacagagtgg agcggagacg 18060 
gcggatgggt ctgggaggcc cctcctgctg gccttgactg tgacagctgt cctgggaatc 18120 
agcttccagg ccgccccagc agcctgactg acacacacag gggttttagc cccatcctgc 18180 
gaccagctgt tgccatcatc agtgacagct gggagtggcg gtggttccag ccctgggcac 18240 
cctccccacc tgctggggcc cacccagggc agtcctgaca cctacaggtt gcttggagcc 18300 
gcatccgagt cctgccccac cacgtgtgaa gcccgagtgg tcgtgggctg aggtcccctg 18360 
attgcatccc cacttccctt ctgcttcaca tagctgcctc ttctcaccgt ttttccagcc 18420 
tcctgggcta ggaattccag tgttgtgctg gctttgcccc aggacacctc cttagccctc 18480 
ttcctgagtc tagagccccg ggggttggaa gtcctggccc ctgggacacc tgcagccaca 18540 
ctcagcttct cctgtgagcc tccagcatgt cccctcagga ccaagccctc acgttcttgc 18600 
ctccccgccc acctgggctc agccagggga aggcctggct gggagcgtct cccctctgcc 18660 
ctgcccttct cccctcctac cctgcccttc tctcctctgc cccgccatgg cttttatatc 18720 
ctgtgccaca agacatggct gtgtgtgaaa gtggcagggt ctggcatctc tgtgggtctc 18780 
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tgaggcccac gctccagtgc cactcttccc acccgctggc cgtgcxctca tgctggaggg 18840 
acagcccagc cctctcccga accccagccc catgtgccca gctgcccccg gcxxtctccc 18900 
ctggaagccg gggtcactcc agccgtatgc catggtgggg acatcctgct tccttggcct 18960 
tccagggaag gtcctctttc caaatggcga cacctggtcc ctgcctggag gctggaagct 19020 
gtggcccttg tatgcccctc cagggtctgt gcgctcggtt ggcccgagtt cccatcaccg 19080 
tcatcatcac catcatcatt gtcatttcgc ttgtctgtga gccggcctgg tctcccagag 19140 
cagagaccct ctgaggtcca gcctgagttg gggtctccgt gctgacccct gacggggact 19200 
caggacgtac caggtctggg tcaggagtga cccccaaacc tcgtgccctt tgacaggcac 19260 
ccctgacttt tgctaagtgg gtggaggtga catcacttac agcgggagtg atgggacagg 19320 
gtctgttggc tgcactgtgc tcccagggat ctggggagag gctatatccc tgggctttgg 19380 
cactgcagag ctgtgtgtgt ttgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt 19440 
gtgtgtgttt gcgtgcgcgc acatgtgtat aagatctttt tttattacat gaagcaagat 19500 
aactgttgct gtttcctttt gggttttgtg ttcaacagag tggggtactt cttccctcag 19560 
acaacagaac tctccccttt aaacacgtgc tgtcagaggg tgggtcttgg gctcatgtct 19620 
gtttgcacag ccgagtcaga ggaaacacag ggttcttcat aaaaacactg cacagcaggc 19680 
gactgtccag agtcagcctg caggacggca gcagccctgc ccctcagagc acagctaggg 19740 
tgggctgctt tgggatctcc cgtcattccc tcccagctgg cagccggcgg ccggcccatt 19800 
ccttggtgtg ctggtcaggg gggcgtgcgc ctgctctgct caccctggga atgggacaga 19860 
agctggcagc tcggagagga cagggctgga cccttgggtg gcctctggct ggaccatctc 19920 
attgtcctca gacacagcct ctcgggtota gtttcatttc ctgaaaaaca agtgcacaga 19980 
actagagcag gagtcgagag ctacggcccc cgggccagat ccagccctgc cacctgtttt 20040 
cacaccatgc tcaagctgag tgggttttac attttttaat tacttgaaaa aaaaaaagcc 20100 
aaaggaggtt tcatgaccca tgaaaattat atggaattca aaaRaaaaaa attatatgga 20160 
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^^cgtstccat^taamcttgagacgggtccgctcg^acccagg 20220 
c.ggag.gcag«c.atggca«gc.cgctguccottg.cc.^ggc.caagcga.c 20280 

ccctgunc agcccctga g.agc.ggga ctacgggtg. gtgccacca. gccoggcua 20340 
«ttt«aa«ttag«aagacaggg.cmtatg«gc^ggcu«ctggaact 20400 
eca^ttggc ctcccaaag. gaggga«a .aggctcgag ccacggagcc oag^gBt 20460 
ttgMtttc actgauaag ttttgccggg tgtggtagtg tgtgcctca gcgamggg 20520 
aggCgaggt gggaggatcg c«aagccca ggag«gag gctgggCca ag«a«agg 20580 
,gg.gaac.a.ga.ca.g«attgcat«agcctgggtgacagagcaagaacctatctc 20640 

,UaaaatatatamaaaaagUt.gggtg«gtggctcacgcc,g.ggKCcagc«c 20700 

..ggcatct gaggtgggag ga«gcttga gccca^agt ttgaggttgc agcgagccaa, 20760 
ga.cg.g.cac.acac»ugcc.gggtgacagagpccagacccgcc.c«aaaaa3a 20820 
,.^aaaaaaca.g.a«ggaacacagcca.g«g«cagtcacg.gc.ccca.gc 20880 

.gcmctgc tccagagacc c«atggcc. gaaagctgaa aamtmct atccmaca 20940 
aaaaagmg ctgacctctg «^gaaaa ttca«^ aagttctctt ccggcactgg 21000 

ec«ccaggag.=caggacaggcacg.^tct^.c«g.ccc«gacgcccagaggctt 21120 
ggcmctc aggcattctt ggaaa,a« ggCccagga aagg«gagg cc^ctgag. 21180 
cggcccagag ggaacctgcc ccagg^tgg gggaggccg acccagcaga gtgg««g 21240 
ecgatggg^ gggccggtca agatg.go.g aaagfg^c .cagaaggcc acmgggat 21300 
,^«:cag«t.agagcaactgagagctgc«at.goaagcc.gatgt«cccag 21360 
aggccgggtccaccgggtgoccgggattotggga^ggg^gaaagtagggggcag 21420 
ggggag.g.cc.gggttCggaa^ggtggc-PSP8a88«cagggagtggcttc 21480 

tgagccacca .aggggt..c tgtgggaggc tcgcocatc caggaga«c cgcagg^t 21540 
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gccggcccag agccagcgtc ttgcgcttgc cgaggctaca gccagcccca gccgggtgga 21600 
acagcccgtc gcctcctctc actttgtttt ggggccacct gggagtgtgg agcaagggta 21660 
gagagggagg aagtggctgc cggccgctgc ccagcaccct tgtttgcctt gggccctctg 21720 
tgggctcctt tttattgctc ttcaatgaag ccagggaaat ggacttcctt gcctcacttc 21780 
agttcaacat gtctggaagt ttggtattaa aattaagaaa gtgtggaaat agagcaagaa 21840 
gagaaaaatc tctccaagag ataatagtga cctctgagct gggcgcggtg gctcacgcct 21900 
gtaaatccca gtactttggg aggctgaggc gggcagatca cctgaggtcg ggagtttgtg 21960 
accggcctga ccaagatgga gaaaccccgt ctctactaaa aataaataaa taaataaata 22020 
aataaataca aaattagcca ggcatggtgg cgcctgccta taatcccagc taaggcagga 22080 
gaatcgcttg aacctgggag gcaaaggttg cagtgagcca agatcacgcc attgcactct 22140 

agtctgggcaacaagagtgaaactccgtctcaaaaaaaataaataaataaaaaataaaaa 22200 
tagtgacctc tggccaggtg tggcagctca tacccgtaat cccagcactt tggaaggaag 22260 
gccgagatgg gcagattgct ttagcacagg agtttgagac cagcctggcc aacatggtgg 22320 

aaccccatctctacaaaaatagaataaaatttaagaggtaatagtgaccttttggtagat 22380 
cgaaacctgg attgctttct ttttctaaat gctgattctt ttctttgtgg tgtttgtgtt 22440 

ctgtgccgatgtccx^toccccagccctgttattgtgagtggaagaaggggaaagggttcg 22500 
cccgctactg tgagcccctc ctctcacgct gggtgtcctt ggagaagcct gcacttcttc 22560 
attgtacgcc agggctgggt ccctccctgg agtggttctg tgctgctggg atggggccaa 22620 
cccctcagatgttttctgagtgtcacaeacaggtgtgtgcattcatggcctttgcgtgtc 22680 
ttcctgttgtggaggcaaaaatgtgaagaaccctagatgattttgggaccagggctocat 22740 
cacctgctgt tcattgcaca ccggagcatc caggcatggg tggagagctc agacttccag 22800 
gcacggtcgc aggggctggt ctaaccatgt tcccgcccgc ctgctcgtca gaaccgcctg 22860 
ttgggagctg ttatcatgat accatacctg ggccctgggc tatccgattc tgacttaatt 22920 
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gcocaggtt ggggecaggo cgttgmgc .gtmgttg « gacg«agcc 22980 
actgggcm. .c^agcccc tcagnacag gtggagaaac tgagacccat gggggtgcaa 23040 
ggacttgccgaggacccagagccccttgggggcagagc^aggcggggccttgcmggg 23100 
,cccagagc.tccagtccccttcccgctetccmcagc.ttmt«gagacaaga.c 23160 
.caccctgtcacccaggctggag.gcaa.ggca,gatc,oggc.caCgcaa.c«cgct 23220 

agcgcgBc cagcgattC ^cC-g cc^ccgagc agcgggatt acaggtgtg. 23280 
gccgcca«cccagctcgttmmtg«cttttag.agaga,agggttteacca.gtt 23340 
ggccaggctg atcogaao. cotgacctca aatgatccgc c«cc.cggc ccccaaag, 23400 
gcaggaaa caggcggga .cacactg« cCggccCa gcagcmg. c«gtgcoat 23460 
ccaacaacag a«accgaag ««g«c ttaacatgca cccatctgc cucagm 23520 
.gccacagcaaaacagaggacttg.cgcmaggtaag«ggaaa.g«a«gi.a 23580 
gcaggaggcc..g«gaagctgcc«aatggc«g.g. create cgtcctgag 23640 
agccggagaa c«gga.g« gcac«aa« caacc«cct g«aaca« .gttc«cag 23700 
gccatggat catcagaacc acgtcctatc .cacgcgga gmtgcttcc gttggttcag 23760 
g.gm.uccttgacag.attt«cctcgg«g«Mgcgg.ggttgc«ttaa«=a 23820 
gca«gac.cOcaagaaaaa.att.agctgc.aca«cagaggagacaggg.ggaaag 23880 
ca.agagacagcaggc.cagac«agaaccagaag.gc^cagagttcatccggccc 23940 

.gacccageg ggaaatgagt tcacagagaa gcgggagaac mgccccag gcccgccgt 24000 
,gct^taactgccceaggtcc«aca«gc.ccagg.cctgccccaggcc«gcagtt 24060 

goKalaaa gccccaggtc ctta««g Cccaggtcc tgc^cagg. cctgcagttg 24120 
cWgtgtgg Wgtgtgat CggagccC ccgccca«g «gcacc«g ggoaggca« 24180 
gomttga, cccaggactc ctt^tgcgg agcacgccc, ggnCccag gcagccgCg 24240 
eotgtoagcc.gcag«gttcgggagaggacacagottgcc«g.ctgttccaaaKtt 24300 
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tggagagagc agcatccagg tgcggcaggg acaggcctgg ggctcgcggg cagggactct 25740 
gtgtcctgcc ggggtcccac actgcacctg cttgtcagag gcactcagtc aatctttgct 25800 
gatgaaggat gagaggacag aggacgtgat gcttgctgct gcattgcctg cagtcctggg 25860 
tgagatgccc gggttgactc tgctgcccgt cgggtggatg tgatgtcaga tccccggctt 25920 
taaaatacga gggagctggg aattgaggga gcaggttggg gcagaaagca cagccccgtg 25980 
gaagcctgga gctgaggcag tgtgggcgac ccctggagca gtgagtgctt ccttcatggc 26040 
cttcatcgcaccctgcagtcctcatgtaggggatgccatccatgaattta'gtttt^^^^ 26100 
cctcctttaa aaacgcgttc atgctggggc cggggcagtg cagtggctca catctgaaat 26160 
cccaccactt tgggaggccg aggcgggtgg atcatgaggt caggagatcg agaccatcct 26220 
ggctaacaag gtgaaacccc gtctctacta aaaatacaaa aaattagccg ggtgcggtgg 26280 
cgggcgcctg tagtcccagc tactcgggag gctgaggcag gagaatggcg tgaacccggg 26340 
aagcggagct tgcagtgagc cgagattgcg ccactgcagt ccgcagtccg gcctgggcga 26400 

cagagcgagactccgtctcaaaaaaaaaaaaaaaagtacaaaaaaaaaaaaattagtctg 26460 
ggtgtggtat cacgcgccta taatctcact actcgagagg ctgaggcgga gaattgcttg 26520 
aacccaggag gtagaggttg tagtgagccc.gtatcgtacc actgccctcc acctgggcaa 26580 

tagagcgagactctgtctcaaaaagaaaaaaaaaaaaagaacatttatgccaggtgtggt 26640 
ggctcatgcc tgaaatccca gaactttgga agactgaggc aggaggatca cttgagccca 26700 

gaaatttgagagtgtcttccctgggcaacatagagagacctcatctctaccagaaaaaaa 26760 
aaaattagcc cggcatggtg gcatatccct gtggtcccag ctacttaggg ggctgacgtg 26820 
gcaggatcac ctgagtctgg aggcagaggt tgaagtgagc tgagatcatg ccactgcact 26880 

ccagcctgggtgacagacagagaccctgtctcaaaaaaaaaaaaaaaaaaaagcatttac 26940 
tatccaccat ggaaggtgag actgacctgt gagtgattgt tcaaagaaca aaaaataaac 27000 

cccagagataagacaaaagggtgcctccatgggggtgtgatttaaagctg agaaattggg 27060 
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cttcttcccc ctcccctctc accccgtggt ttgctaaagg agatgggaaa aaggattctt 27120 
tttttggctg aaatatttaa cactaaatta aagccaattt taacagcact ttggttgatg ■ 27180 
agtgaaatta acagactggc caaaaataaa cgaacggtct gtactatgtg aaaaagaggc 27240 
agctttggcc atgctgggcc aatgtgagtt ttcagggttg ctgggaatgt ctgtgaatcg 27300 
gaggaaggge ctagctggga ctctcaggag ccaaggccct gaggggcaac ttgcctggtc 27360 
cctgccctga ggcgttcact gctttcttcc tgggccagat cacaggcccg gaggctggac 27420 
cactgggctg gcactcttgc cgagctgctc cctgacttcc tgaccatgct cctttcagca 27480 
gccttgctgc actttagttt ccttgaatga aaaatgggga tgagaatagc tcctacctcc 27540 
aaggtgaatg gagtgagttc ggacaggtga ctccctggga ccagtgcctg gcgcctgaca 27600 
aggtccagtc agagcccgca ctgctgttac tgataccctt ggctgtacca ggggagaact 27660 
tggttgccat tgccaggtgt tctcccacca cccccactac tgtccctgtt tgatgtgtgg 27720 
cgggaataaa gctgtgcaca ttggagcttt tggcacatcc tggctttcag gtgaaaggtg 27780 
cgtgtgtgtt tgagggttta gcctggccaa cccagccatg aggtcggacc tgacctgggg 27840 
gtgagtcctg agctcggcac ccctgagctg tgtggctcac ggcagcattc attgtgtggc 27900 
ttgggccgca cccctttccc tgctgggctg ttgatgttta gactggagcc tctgtgttcg 27960 
cttccaggaa ccaacccgtg tgcggacagg aacggggggt gcagccacct gtgcttctgc 28020 
acaccccacg caacccggtg tggctgcccc atcggcctgg agctgctgag tgacatgaag 28080 
acctgcatcg tgcctgaggc cttcttggtc ttcaccagca gagccgccat ccacaggatc 28140 
tccctcgaga ccaataacaa cgacgtggcc atcccgctca cgggcgtcaa ggaggcctca 28200 
gccctggact ttgatgtgtc caacaaccac atctactgga cagacgtcag cctgaaggta 28260 
gcgtgggcca gaacgtgcac acaggcagcc tttatgggaa aaccttgcct ctgttcctgc 28320 
ctcaaaggct tcagacactt ttcttaaagc actatcgtat ttattgtaac gcagttcaag 28380 
ctaatcaaat atgagcaagc ctatttaaaa aaaaaaaaga tgattataat gagcaagtcc 28440 
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agcctattct actgtttgta ttacatagct ttaaaagatt ttttatgact ttaagtcaca 29880 
agggttcttt gtagaaaaaa atatatatat aggaaagtat aaaaagaaag taaaaattgt 29940 
ccataacctc tccagccaga gacgaccgtt gctgacacct cagcatattg cctttaagtc 30000 
ttttttctct aagatagcat ttctcttcat cacagtcata tgctacgcag aattctgtat 30060 
cctgattttt tcacttgaca ttacaacagg tatttgatgg cgctgtgaca aactctttgg 30120 
cacaatcttt taaatgtatg aaatactcca ctgcacagat gtttgctttt aggcttaact 30180 
gttcttttat tttgcgtgtg ctggttacag ccgggcacag tggctcatgc ctgtaatcac 30240 
aacactttga gagggtgagg caggaggatc acttgagccc agaagtttga gaccggcctg 30300 
ggcaacatag tgagacccca tctctacaaa aaactttttt aataagtcgg gcgtagtggt 30360 
gcatagctgt agtcccagcc accaaggagg ctgagttggg aggattgctt gagccccagg 30420 
aggttgatgc tgcagtgacc tgagattact ccactgtact ccaacctgag cgacagagca 30480 
agacttgtct ggggaaaaaa aaaaaaaaaa tatatatata tatatatata tatatacata 30540 
tatacataca cgcacacaca cataatataa aaatatatat ttataaatat ataatatata 30600 
atataaaaat atatatttat aaataaaatt tataaattat atttataagt aaatatataa 30660 
tatataatat aaaaatatat attatataat atataataaa atatataata taaaaatata 30720 
tatttataaa taatatataa tacatactta taagtatata tttaaaatat atgtaatgta 30780 
tattttttaa tgtatgatat ataatataca tttataaata cacatttata ttattttata 30840 
taaaatatat ataaaatctc caagttgctt tttccaaaaa ggtgtcttgc tgcatttcaa 30900 
acattcattt aaaaacttga atgctggtga tctggtccag aatgtgttca gtagctgctg 30960 
ccagtggcca agcatctcgg gagatgtcta caaaacacgc tggttctggc ctggcgtggt 3 1020 
ggctcacgcc tgtaatctca gcactttggg aggctgaggc aggtggatca actgaggtct 31080 
ggatttcgag accagccttg ccagcttggt gaaaccccat ctctactaat aatacaaaaa 3 1 140 
aattagccag gcgtggtggc atgtgcctgt aatcccacct acttgggagg ctaaggctgg 31200 
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ggtagacaca cataagggct tttgtgaaat gcttgtgtga atgtgaaata tttgttgtcc 28500 
gttgagcttg acttcagaca ccccacccac tcccttgtcg gtgcccgttt gctcagcaga 28560 
ctctttcttc atttatagtg caaatgtaaa calccaggac aaatacagga agactttttt 28620 
tttttttttt tgagacagag tcttactctg ttgcccaggc tggagtaccg tagcgtgagc 28680 
tc^ctcactgcaacctccgcctcccaggttcaagcgattcttc^^^^^ 28740 

gtagctggga ctacagacat gcaccaccac acccagctaa ttttttttat atttttagta 28800 
gagacagggtttcatcatgttggccaggctggtcttgaactcctgacctcaggggaacag 28860 

acggggttgg cctcccaaag ggcggaaata acaggggtga gccaccgttc ccggcctagg 28920 
aaaactttttgccttctaaagaagagtttagcaaactagtctgtgggctggccttctgat 28980 

tctgtaaaga aagtttgatt ggtggctggg tgcggtggct cacacctgta atcccagcac 29040 
tttgggaggc cgaggtgggc agatcacctg aggtcgggag ttcgagacca gcctcaccaa 29100 
cgtggagaaa ccccgtctct actaaaaata caaaaaaaaa attaaccggg catggcggcg 29160 
cctgcctgta atcgcagcta ctcaggaggc tgaagcagga gaattgcttg aacctgggag 29220 
gcggaggttg tggtgagctg agatggcacc attgcactcc agcctgggca acaaaagtga 29280 
aactccgtctcagaaaaaaaaaagtttgattggtgtaacxaaagcgcatttgtttatgga 29340 

ttgtctgtgg cagcttttgt tctgccgaga tgagttgtga cagatctgta tgggctctaa 29400 
agcctaaaac atgtgccatc cgccccttta cagaaaaagt gtgctgacct ctgttctaaa 29460 
gtattggacaactacaatgtttgctcatttattattctatgatttgttttctgctttttg 29520 
ttgttgttgt tgttgttgag atagggtttc cctctgtcac tcaggctgga gtgcagtggt 29580 
gtaatttcag ctcactgcag cctcgacctc ctgggctcta gtgatcctct catctcagcc 29640 
tcxctagtagctgggactacaggcacacaccaccactcctggctgatmtt^ 29700 
tttttttttt gtggagacag ggtttccgca tgttgcccag gctggtttca aactcctagg 29760 
otcaaacacc cacctcagcc tcccaaagtg ctgggattac aggcgtgagc caccatgccc 29820 
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agaatcgctt gaacccaggg ggcagaggtt gcagtgagcc gagatcgcac cattgcactc 31260 
caggctgggc aagaagagcg aaactccgtc tcaaaaaaaa aaaaaaagat gctggttcct 3 1320 
aaaatgtggc ccttttcctc ctcacctgct gccagaccat cagccgcgcc ttcatgaacg 31380 
ggagctcggt ggagcacgtg gtggagtttg gccttgacta ccccgagggc atggccgttg 31440 
actggatggg caagaacctc tactgggccg acactgggac caacagaatc gaagtggcgc 3 1500 
ggctggacgg gcagttccgg caagtcctcg tgtggaggga cttggacaac ccgaggtcgc 31560 
tggccctgga tcccaccaag gggtaagtgt ttgcctgtcc cgtgcgtcct tgtgttcacc 31620 
tcgtatgaga cagtgcgggg gtgccaactg ggcaaggtgg caggctgtcc gtgtggccct 31680 
cagtgattag agctgtactg atgtcattag ccttgatggt ggccaggact ggtagggccc 31740 
tcagaggtca tggagttcct tcgtggagcg ggtgctgagg ctgtatcagg cacagtgctg 31800 
gctgctttca cctgggccgt ctcaccgaag tgtccatgga gcctgcgtag ggtgggtatc 31860 
tgtgtcgatt ttacagatgc agaaacaggc tcagagaaac cgagtgactt ccctaaggtc 31920 
acatacccag ttagagcaga gctgggccag gaagtgctgt ctcaggctcc tgaccaggtc 31980 
tccttgcttt gcactcttgc caaaaccatg atccagaact gactttgagg tccccggacc 32040 
tcaggctcct ccgaaatggc ctcttggagg ctgctgagcc acagcttagg acccacctcg 32100 
agaggcaaat gtgctttgag ctgccaggcg tcctgggggc cctgccttgg gcacggggtt 32160 
cagacaggcc ccagatgtgt ggggcgtctt tctggacttg agttttcttt tctgtgtggt 32220 
ggacacagtg ctcacccctt aaagcacctg tgatgtgtgc agcagcccaa tccctgcctg 32280 
tcgcctgttc tgctagggaa ggaaggaata cttcaggatg gcaggacaac agaaagaggt 32340 

ccaggttttagagcaagggcaggtcaaacttagaaaattctggaatgaggatgtgcattt 32400 
cctcttctgg atctgctaaa agaagaggga aggaggggct gctgggggag gagcccagag 32460 

ccgagtttacatccggatcccgcaaggcctcccctgccctgaggtcttgttttgtgatgt 32520 
gcttgtgtcc atcctggttt ctgccgtgtc cccaacatcc ggccaagctt aggtggatgt 32580 
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agtcccagct acttgggagg ctgaggcagg agaatggcgt gaacccggga ggcggagctt 42300 
gcggtgagcc gagatcgctt cactgcactc gagcctgggc aacagagcaa gactccgtct 42360 
cacgcaaaac tctgtctcac gcaagactcc gtctcaaaaa aaaaaagagt tcagggttta 42420 
tgaaactggc cagccgcgta aagtttgctg tgttgttttt gtgcccggga ggagtgtggc 42480 
cagggtgtca cgtcacacag tacacgtttc tcagatggtg gttctccaga ctgctgtccc 42540 
aaagtctgtt tttgcatctg gttcccacag acccaccctc cacggtgagc ctgattttgg 42600 
ccagggtagc tggaatcttg cttgtctttc agcccggcag ctgtaccagt ccagggtcca 42660 
cagctagtgg cttttaggaa ggaatttgtt cagttggctt tgacacatgg ccccctaggg 42720 
tccacagctc tgtagtgatg tggatgttgt tatctacaaa gacacatgat ccttcgtgtc 42780 
cagatgaaag tgatgatgtc tttgcagctg cccagcaagg ctgtgtgtgt gtgtgtgtgt 42840 
gtgtgtgtgt gtgtgtgtgg tgtgtgtgtg gtgtgtgtgt gtgtatgggg gagggaggca 42900 
ccctttccat ctgggggtgt gtgtgtgtgg ggtgtgtgtg tgtgtgtgcg cgtgtgtgtg 42960 
gtgtgtggtg tgtgtgtgtg tatgggggag gcaccctttc catctgggtc caagagactg 43020 
ggcctgggga agacgcttct ttttatctac ttagagactt tgttttattt gtattttttt 43080 
gagacagggt ctcactctgt cacccaggct ggggtatggt gatatgagca tagctcactg 43 140 
cagcctcggc ctcccaggct gaagcgatcc tcccacctca gccttctgaa tagctgggac 43200 
tgtaggcgtg cgtcaccata ctgagctatt gttttttttg tttggttggt ttaatttttt 43260 
ttgatacaga tggagtcttg ctatgttgcc cagactagtc tcaaactcct gaactcaagt 43320 
gattctccca cctcagtttc ccgacattct gggatcacag gtgtgagcca ctgctgtctc 43380 
. cctgttttat taactgctga aagacctaga taaagaaagt ctgaaaagac ttactatcag 43440 
agcaccatcc taagatgatt ccctctgact caatggagag ggaggggagc ttttccttca 43500 
ggcctgggtg gcaggagccc aggtgctcca ggccccattt gccccaggcc aaatcactcg 43560 
ggaacttgga tgcagctgtc tttcagggta acccaaagga acpagatccc cgcaggcagt 43620 
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aggcttctgg gctgtcctct cctcctacgt cagctcagta agagcccttc gaagggatgc 43680 
tgtgtcggag gccccaaaag cccaggctca tccctgagat gcacagggtg ggctgggctt 43740 
aggcagcgct cgagcatctc ctggacggtg accccagaga gtgtggagac ggagagtcct 43800 
tgagagtcac tgagagacgt ggctgccctg ccttcccaag aggggctctg agtcattccc 43860 
cacactcacc tgcccctacc caccctcacc tggcccccag cctcacctac ccccacatct 43920 
gtaccgatcc ctttacccgc accttcccta cccaccctca cctcccctgt accttcacct 43980 
cccccactca cccgcccctg caccctcacc tgtcccccac cttcacctaa cccccaccct 44040 

cacctgccctcccctcacctggcctccttccgttggggaaggggttgtaaggggcggccc 44100 
ccaaactgtc tgtcctggtg ccctgcagag aaaacagtac gtgagggccg cagtccaaaa 44160 
gcttgagtoc tggaaggtgg aggagacagg gatgtgttgg gaagggcccc atggtcttgg 44220 
atcccttctc gactgtcaat ggggccttca tgggagcgcc agtctagtga tgcacagctg 44280 
gg^cccggc gggtggctga ggaggcctaa agtccgaggc ggcaagagct cttccagagg 44340 
ctgttgtcct aatcgctctg gcatactcag gcgggcacgt agttaggagc tgattggaga 44400 
ggagagaccc ccacaccaat actgggattt gactttcagg ctaaacttga gaagtgtggc 44460 
ctctgctgtc ctgccagagc tctccagcca gtgcccaggg ctctccagcc agtgcccggg 44520 
ggtctccacc agtgcccggg ggtctccgcc agtgccaggg gtctccgcca gtgcccaggg 44580 
gtctccgcca gtgctcagga gtcttggttt ctttgtctta cagccctttg ttttgacctc 44640 
tctgagccaa ggccaaaacc cagacaggca gccccacgac ctcagcatcg acatctacag 44700 
ccggacactg ttctggacgt gcgaggccac caataccatc aacgtccaca ggctgagcgg 44760 

ggaagccatgggggtggtgctgcgtggggaccgcgacaagcccagggccatcgtcgtcaa 44820 
cgcggagcga gggtaggagg ccaacgggtg ggtgggggtg ctgcccgtcc aggcgtgccc 44880 
gccgtgtctt ctgccgaatg ccagcctctc acaggctggg gagactttcc accctgggga 44940 
tccaatgggt ggctttccag ggtcccaaaa gcaaacacag gctctttcac agcccctcca 45000 
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ggaaagcaga aagccccaag ggctggaagg gaagggggag ctctgctgag aggttacaag 45060 
gcagcgctgg ccgacgggag ttgcagttga taggttttgt atcatccttg ttaaacttga 45120 
accctgtgca gaaatccctt ccacggcatg ggggctgcct gttgactcgc tcctgttcca 45180 
ccacagggag ctcctgggct tcttcctccc agaggccccc gacgctccca cctgttggtc 45240 
gtcagagctt ctggttggtg ggaaggcacc caggaccttg aggtctccag agagaaaagc 45300 
cagggaaaga gggagaccga aacccatgtg acatgaaact caggctccaa actgagcacg 45360 
ggaacgtttg gggacaggag cgcgatggcc ttcctcagat agctgggggg ctggcatgaa 45420 
gacgggagct acagccagca caggtcctgg gccgggagec cagagattga gccctgactc 45480 
tgtcacttac tggccacgtg accttgggcg ggtggcatag cctcttggag actcagtttc 45540 
ctcattggta ggagtgacgg ccacagtggt gcggcctctg cagcacacgg ggggctcggt 45600 
gggcggaagc cccgggtcta taaggcggct gtgcaggagc cagccgagct ggtctcccaa 45660 
cagccagggc tccggggtcc ttagcagctg tggggggcct gcacctgttt cccatggctg 45720 
ctgtcagaaa ttaccagaag ccaggtggct gagagtaatg gacacttgtt ctctcacagt 45780 
tcctgagggc tgaagcccga gatcgaggtg tgggcagggc cctgcgccct ctgaaggctc 45840 
tgagggaacc tttgggcttc tggtggctcc aggcacccct tgacttgtgg tcctgtcact 45900 
ccagtctctc tgtctggctg cacatggcgt ggcctcttct gtaccattga aggacacttc 45960 
agttggattt agggcctacc ctcacccatt gtggtcgtat cttgatcctt catgacattt 46020 
gtaaagaccc tgcttccaaa taagctcaca ttctgaggtt ctggggtgag cgggaatttg 46080 
gagagcattg ttcaactagt atagaatgtg acctgtcagc ctcgggcagc cctgagaggc 46140 
aggggctttc cacagcccag ctgggtgccc tgggctccgt gctgtccgag gagacgccat 46200 
ccccacaccc gtccttcacc cgccaccctc ccgcaggtac ctgtacttca ccaacatgca 46260 
ggaccgggca gccaagatcg aacgcgcagc cctggacggc accgagcgcg aggtcctctt 46320 
caccaccggc ctcatccgcc ctgtggccct ggtggtggac aapacactgg gcaagctgtt 46380 
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^tgggtggac gcggacctga agcgcattga gagctgtgac ctgtcaggta cgcgccccgg 46440 
ggcctgccct aaccgcagac acccggcctt cattgtcagt aatggcagca gctgccacat 46500 
tgtccgagac ctgccgtgag cccagtgccg cgccaggggc tttgtgtgta gcgtgttttg 46560 
tcctcacact gacagctgta ggctggggtt ctgagtgagc cccacagggc agaggcagaa 46620 
aatgagtctc agagagggtg agcgagctgc ttggggcccc acagcaggag atggagcagg 46680 
actgcagcct agcctctgcc cccagcacct gcgcaagaag ctgctctgct ctggactgtg 46740 
ttaggctgcg agggctggag agaaatgaga gttggtgctt agagaggggg cgcaggtccc 46800 
catggctttt cctcttatga tgaggtagat gggtgaaggg aggggccatg cttgcagggg 46860 
ccagtgaccg aggcccgccg ttggaactga tggccttcat cccgagccca gcccaggtgg 46920 
gagcagggct ttccgagggc ttgtcttggg tcggcctgct tccagggact ctgctgcagc 46980 
tcccacccct gtccaaagca tggaatcccc caggctccct ggcagtcctg tcaacctctg 47040 
tcctcccaag ctgagtgtgg ggcaagttct ggaggtcagc actgctcagg ggggcccacg 47100 
ggctgcttgc aggggccaac cgcctgaccc tggaggacgc caacatcgtg cagcctctgg 47160 
gcctgaccat ccttggcaag catctctact ggatcgaccg ccagcagcag atgatcgagc 47220 
gtgtggagaa gaccacxggg gacaagcgga ctcgcatcca gggccgtgtc gcccacctca 47280 
ctggcatcca tgcagtggag gaagtcagcc tggaggagtt ctgtacgtgg gggctggcag 47340 
tggggtgggc agggtggcct ctaaacccga cccctggagg aggctggagg ccagtgcaag 47400 
atcctgtgtg gcctcagcca ggcggtggtc tctgccagat gccaactgtt gcccgctggg 47460 
gttcagcgac atgtccgaat gtcccgaggc ctctgaggtt gtmctm gccgcagaac 47520 
aaatcaccac gaacagcgtt ttaagacaac accaactctt ULlUULL ttttttttga 47580 
gtcaggatct tgctctgttg cccaggctgg ggtgccctgg tgcaaacaca gttcactgca 47640 
gcctcgacct ctgggcttaa ttaagtgaac accttgcctc agcctcccag gtagctggga 47700 
ctacaggtgg gcaccaccac acctggctaa tttttttttg tagagacggg gtttccccat 47760 
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gttgcccagg ctggtctgca actcctgggc acaagctatc tgcctgctgt ggcctcccaa 47820 
agtgctagga ttataggtgt gagccactgg cctgacaaca cccacggatt gtctctcagt 47880 
tctgtaaggc aaagtccagg cacagcgtgg ctcacctggg ttctctgctc agggtctcac 47940 
ggggccagaa tcaaggtgtc aggaacgctg ggccctcagc ggaggctctg tggagaaatt 48000 
agcttccttg ctcactcagc aggtagcagt tgtgggatcg aggttctgtt ttctctctgg 48060 
ttattggtcg gggaccactc tcagctccta gaggccaccc caggtccttg ccccgtggcc 48 120 
ctctctgcct cagcagtggg ggctccxtgc gtcagtccct cccgcacctt gagtctctct 48180 
gatttgcttc taaagggccc tgtgattcgg ctcagccacc tttagattag gttagcctcc 48240 
cctttgatag actccaagtc ggctgattaa taaccttact cacatctgca gaatcccttc 48300 
tgccacataa ggtcatgacg ccgtgctggg gactggggtg ggaaattacg gggtcattta 48360 
ggattctgcc tgccactgcc ttgctgtgtc ccagggcttg ggggaggggc ctccacagct 48420 
gggaccacag tccttcctcc cctccatggt aaccatctga ggattacttg agaccagcct 48480 
gggcaacatg gtgagaaccc atccctacaa aaaatacaaa caaaaaggga ccaggctggg 48540 
cttggtggct catgcctata atcccagcac tttgggagac caaggtgggc tgatcacttg 48600 
aggttgggag ttcgagacca gcctgcccaa catagtgaaa tcccgtctct actaaaaata 48660 
caaaaattag ctgggtgtgg tggcaggcgc ctgtattccc agctactggg gaggctgagg 48720 
tgggagaatt acttgaacct gggaggcgga agttgcagtg agccaaaatt acgccactgc 48780 
actccagcct aggcaataga gtgagactcc gtctcaaaaa aaaaaaaggg ccaggggtgg 48840 
tagtgacaaa gagaccctat cccaaaaaaa ccgaacactg aatccttgag actgagtaag 48900 
gacactgtga aatttttctg ggtggggcag ggaacagagc gtcttctgtc atttcttcca 48960 
cctgggtgtg gtcagctctc cctccaagct gcctcctctt cttctcattg tccgggtgtt 49020 
ggacacattt ggttaactgg atagaataac gcgagttccc agggacttgg tccatttgct 49080 
attttatttt atatatttt attttatttt atttatttat ttatttattt atttatljat 49140 
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tgagatggag tttcgttttt gtcgcccagg ctggagtgca gtggcgcgat ctcggttcac 49200 
tgcaacctct gcctcccagg ttcaagtgat tctcctacct cagccttcca agtaactggg 49260 
attacaggca cccaccacca taccaggcta atttttttgt atttttagta gagacgggtt 49320 
ttcgccattt tgcccaggct ggtcttcaac tcctagcctc aggtgatcca cgcacctcgg 49380 
cctcccaaag tgctgggatt acaggcatga gccaccacgc ctggcaccat ttgctatm 49440 
aattcccatg tgtattagtg tcccacggct gctgtaacaa atgaccacaa actggatggc 49500 
ttaaagcaac agaaatggat tcccccaatg tgctggagac cagaagcctg cgaccaaact 49560 
gttgggaggg ctgtgcttcc tctgggggct ccagggagga tctatttgtt ggcccttcca 49620 
gtgctgtggg tgccagcgtt ccacacttgt ggatgcgccg cctcaacctc tgcccatctt 49680 
catgtgtcca tctcctttgt gtctgcgtct ttacctcttc ttcttgtctg tgttgcctct 49740 
tataaggacg tttgtcattg ggtttagggc ccacccaaat catccgagat gacctcgtct 49800 
tgagatcctt aacctgcaaa gacccttttt ccaaaaaaag gttatgctca cagattctag 49860 
gccttaagac atgggtgtat ctttctgggg ggcactatcc aaccccttat acaatgaaag 49920 
acgggaagag ggccaggtgt ggtagttcac gcctgtaatc tcagcacttt aggaagctga 49980 
^gcgggagga tcacttgagc ccaggagttt acaagtagct aggcaacatg atgagacccc 50040 
atttctacaa aaagtaaaaa aaaaaaaaaa aaaaaaaaag ccaggtgtgg tggctcacac 50100 
ctgtaatccc agcactttgg gaggctgagg caggcagatc acgaggtcag gagattgaga 50160 
ccatcctggc taacacggtg aaacxccgtc tctactaaaa atacaaaaaa ttatggccgg 50220 
gcgcagtggc tcccgcctgt aatcccagca ctttgggagg ccgaggtggg tgaattacaa 50280 
ggtcaagaga tcgagaccat cttggctaac acggtgaaac cccatcaaga tcacaaggtc 50340 
aagagatgga gaccatcctg gctaacacgg tgaaaccccg tctctactaa aaatacaaaa 50400 
aattagccgg gcatggtagc gggcgcctgt agtcccagct gctcgggagg ctgaggcagg 50460 
agaatggcgt gaacccggga ggcggagctt gcggtgagcc gjgatcgctc catgccattg 50520 
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atgggacgtg ctgacaggtc ctctgccggg ttcctgcctt gctatgcgca cgctggtcac 54720 
cacagaggcc tggcccttct tctgtagcag tcccacaccc gcaacaggtg tggctgctga 54780 
ccacctgctt tctgcccctc tggtcctgag gagggcgcag tgggcactca ggcgtggctg 54840 
agcagatgtg tgttgccggg aggaggaagg actgctccag tcagggctga atttcccacc 54900 
cggagcattt ctgctgtatt tggtgtagcg dctgctgctt aaagctctga ttcccagttg 54960 
gcaccctttc ccttctgcat tgaaaaacat acggatgcat gtcttcttgc agtgaatgtg 55020 
tattctccca gcctctcttc tgggttgggg ctggaggtgg agcggcacac aggagccgca 55080 
gcgatggagg atgtgcgggt gcagcacccc gtacagcagg gatgccaaac ccgcgctgag 55140 
tccctctcaa cttctgcttt gaagcccagt cacgccattg cctgggtttt gctgggcggg 55200 
gctgcatgtg atgttctcct ctgtccctcc cccagagccg cccacctgct ccccggacca 55260 
gtttgcatgt gccacagggg agatcgactg tatccccggg gcctggcgct gtgacggctt 55320 
tcccgagtgc gatgaccaga gcgacgagga gggctgcccc gtgtgctccg ccgcccagtt 55380 
cccctgcgcg cggggtcagt gtgtggacct gcgcctgcgc tgcgacggcg aggcagactg 55440 
tcaggaccgc tcagacgagg tggactgtga cggtgaggcc ctccccgtca aggctctgcc 55500 
aagaccctgg ccctgccctc cgggatacga gcttggggct gcctccggcc tcacaggagt 55560 
aggggctctg aaaacctttg cttgcaggga gattgccaag tctgtctttt aggcccaaca 55620 
aggaaaactc tgcagttcca cccatcctgt cccaccaggt agtgtggctt gaaggcagac 55680 
tgtgagggtc tatctcacct tcctgcatta ggtcaggagt ttcacagaaa cctgaggcac 55740 
attcaggggt gggctgcaga ggtccatggc tcacaccctg gaaaatccgc ccccaaaaga 55800 
cagtgctgtc tccactgacc agtctgtggg atagtgctta agcctgagtg gtttctatca 55860 
acatgtagaa tcaggaggta taaagagatt tgctcaggca tcctgggccc tctctgacca 55920 
gcaggatctt cctttagatc ttgacagtga aacacatctc ttctgtgccc cctgtgagtt 55980 
ttctttcatt cattcattca ttcattcatt cattcattca ttcgagacag^ agtcttgctc 56040 
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agcagagggg acattttgtg actgtccccc tcctgagctt cccagcagct tttctccaag 53340 
ttacagccca aaagctcagg tggatttgca acccaacggt gtctgtgcac ctcccactga 53400 
tgcccgaact gccctggcca agaaacgggg ccgtcagaac gctgcactaa ctgcagcctt 53460 
gggcctccat gccagaggcc atgcccttcc atccaccacc ccctggcctg ggccctggcc 53520 
ctcctggctc gggaactcca ggccccttcc tcacggatcg agagacgtgt atttaccgca 53580 
caggtgcttg tcattctctt gtggcctctt ctccagggag atcacagaag gacagggcct 53640 
cactgaggtc tcggacatgg accctttgat agtggcagga gccaggctgg gcaagaggcg 53700 
gccacagtca cctcagcagt gccatcacca ccgccattca gcccttccx:t gagccgggcg 53760 
cgcccctggc tctggcccca gtgtcccagt tacagctcac aggagcttgt ggtgcccagc 53820 
ggctgcttct gattgagagt cgaggtcgga ggctttggga ggctgagagg ctgctcggtt 53880 
tcacaactgc tgagggagac ttgggctcca tctcaggtct gccccatgtc gccctcaacc 53940 
tccagccacc ggtcctccgt gtcccccatg gccaggcacg gcttgcagac atctgtcgtt 54000 
ggctcctctc agccgtcgtg ggctgaccct ggcacgtcct cctgtggctg agcccagtgg 54060 
ggacagctgc ttccttttat taccctagaa ctotcgtctt tgatcaggcc ccctccccta 54120 
tgccacacag tccctgtcac tcgggtgagc ccagtagtca tggggaaggc ctgcgggttc 54180 
caaacatcca aaggcttgcg tgcagcatga cagcttgaaa ccgatgtttt ttaccttgat 54240 
cagatttcag cttggcgggg gctttgctca gctttcagtg aggcctgggc cgatttccca 54300 
gcatcccctc ctgaggccag cctctgtttc ctgtgatttt ctgcacaaag tgggagggag 54360 
gagtcttagg aaatgggggg ccacctcgaa acctaggcct cctctggctt ctctgtgcca 54420 
gtgcccccac gctttgtgtc tgtgtcccca gcccatggga ctgtgttatt ccctgagtgc 54480 
tgccgcatgc ccagcccgca ctgaggacgt ggagccccga ggggcaggat ggcctccatg 54540 
gtcacacgta ggaagtggcc tccaccctcc gatgatcctc tccccccctc cctttcagcg 54600 
ccttccccgg gggtgtcatc agccctcctg cctgtgcttt gtcccgtctt ctgcaggcgc 54660 
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cactccagcc tgggtgacag agtgagactc cgtctcaaaa aaaaaaaaaa aaagaaaatt 50580 
agccaggcac agtggcaggt gcctattgtc ccagctactt gggaggctaa ggcaggagaa 50640 
tggcatgaac ccgggaggtg gagtttgcag tgagccgaga tcatgccact gcgctccagc 50700 
ctgggcgata gagcaagact ctgtctcaaa aaaaaaagcc aggcatggtg gtgcatgcct 50760 
gtagtcccag ctactcaaga ggctgaggca ggagggttgt tcgacccacg gagatcaagg 50820 
ctacagtgag ccatgatcgc accactgccc tccagcctgg gtgacagagt gtgaccctgt 50880 
ctcaaagtaa gtaaatagga ggagagacaa gtgggcagtt cagactgatg gtatgggcac 50940 
agtagagact ggtgcagaca ggctggcctg tgatgtcaag caacttctgt aactgtttcc 51000 
ggcatccatt tgtgtgtcaa tttccgtgtc agtaggaaga ctctgtaggc tgccaagagg 5 1060 
aataagtggg aggatcctcc cagagaggcc gggcctgcag gagggccagt tctcatgagt 51120 
tcttatttgg cccctaccct ccaggctgtg gttctgaggt gggagacaga gcctgacctc 51180 
tgtttgtctt gttttgtctt tgcagcagcc cacccatgtg cccgtgacaa tggtggctgc 51240 
tcccacatct gtattgccaa gggtgatggg acaccacggt gctcatgccc agtccacctc 51300 
gtgctcctgc agaacctgct gacctgtgga ggtaggtgtg acctaggtgc tcctttgggg 51360 
tgatggacag gtacctgatt ctctgcctgc taggctgctg cctggcatcc ttttaaaatc 51420 
acagtccctg tggcatccag tttccaaagc tgattgtgtc ttcctttgcc ctcctttctt 51480 
ttctactatg tgcattcggt gctatgaatt ttcctctaag tactgcgttt cctgcatctc 51540 
acaaattttg ttacattttc attttcaggt agtttgaata tttttacact tctcctgaga 5 1600 
tgacatcttt ggctcatgtg ttatttagaa gtgttgctta gtttctaaag agttggggct 51660 
tttccagctg tctctctgca actgatttct aatttaattc tactgtagtc tgagagctta 51720 
ttttatatga tttctgttat tttaaatgtg ttgggtgtgg tgtttttgtt gttattgttt 5 1780 
ttgtgtcttt ttgttttgtt ttgcttcgtt tgttttgttt ttgagacagt gtcttgctct 5 1840 
gtcactcagg ctggagtgca atggcgcgat ctcagctcac cgcaacctct gcctcccggg 51900 
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tgtcacccag gctggagtgc cctggtgtaa tctcggctea ctgcaacctc tgcctccagg 56100 
gttcaatcga ttctcxtgcc tcagcctccc gagtagctgg gatgacaggt gcgcaccacc 56160 
atgcctggct aatttttgta tttttagtag agacagggtt tcaccatgtt ggccaggctg 56220 
gtctcgaact cctgacctca ggtgatccgc ccgcctcagc ctcccaaagt gctgggatta 56280 
caggcatgag ccaccgcgcc cggcctgagt tttcctttta tgaaggacct gcttggttgg 56340 
ttgcctgcca catgttgtca gcaccatggg cccaggactg ctgaggagct gttgatgccc 56400 
tcgctctccc agagccaccg gctctgttag ataattcaca tgcagtctgg ccactgtcct 56460 
acgtcctcat tcacaaagag cagacatttc gtagaagatg agggcctggg agtaacctcc 56520 
ctgcatgttt ttctataaag gcatagtggt taagtccttc cagctcattg accattggag 56580 
aattttatgg aggctgtaga ctaggggctg gtaaactaag ggcccagggg ccaaatccag 56640 
cctgccacct acttttgtaa ataaagtttt cttggtgcac agccatgccc attcattcat 56700 
ttgcacaatg tctgtggctg ctttcatgcc aaaagcagga gaactgsigtg gttatgctgg 56760 
agacctacgg ccttcaaagc cccagacctc acgtctggcc cttgacagac agagcttccc 56820 
cagccctgct gcgcatcctg gcccagcatg tgctgtgtgt gtgatttcag cttgcaggag 56880 
ccgtggttag gaattgtccc tgtgttggtc cattttgcat tgctatgaag gagcacctga 56940 
ggccgggtag attatgaagg aaagaggtct gtctggctca tggttctgta ggcagcacca 57000 
gtatggcacc cgcatctgct cagcttctag tgaggtctca ggaagctttg actcatggtg 57060 
gaagtcgaag cgggagcagg tgcatcacat ggtgagagag ggagcaacgg agagagagag 57120 
agagagagag agagcgcctc tccctcttgc cctcaccttg agaggagatg ccaggctcct 57180 
ttaagtaacc agctcccatg tgaactcaca gtgagagccc atttgctact gcggagaggg 57240 
caccaggcat ctgctcccat gacccaaaca ctgcccacca ggccctacct ccaaccttgg 57300 
ggtcatattt tattctgttc tatgctatgc tatgctatgc catgccatgc catgccatgc 57360 
taitcctatt ctattatttg agacagaatc tcgctctgtt gcccaggctg gagtgcagtg 57420 
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gcatg^t ggctoacgc aacctcca^c «ccaggttc aagcgatcc. cctgcCcag 57480 
cctcccgag. agctggga™ acaggcacac accaccacac ccgggtaatt mgtatttt 57540 
caatagagat ggggtttcac ca.g.ggcc aggCgg^t ca.c.c«g goctcaagtg 57600 
a.ccac«ac «cggcctcc caaagtgcca .gattacaga tgtgagtcac tgcgcccag. 57660 
gaggg.^amccgttgagamggaggggcagacgaggagcca.ctgagccccc.0 57720 

g.cccg«ctagcuc«acccgwgccccgcggtgctgg«gcaggccctmcgccg 57780 
g„o<ggc.goacgc.c,g<«agaagctttcttccagcttggttaccagaaaatcat 57840 
eeeat^acaaggacagggtccccoa^ccca^cecagggcaggaoaccggggg 57900 

oagggcaggt ggggaactga gcaagttc. .gggggcagg cg.ggc.a« gctccctcg 57960 
gg.gggcg.^gggaggggtggaggcagccg«agcgccc.ggc«gctcttcc,ccct 58020 

ggccagagactgtggcct,gtgagc.cccg«tgggc.gcctgcaoctccagtggg«g 58080 
^e.ccc«ccc«ccc,cccctcaag«c«c.gagcaccactgcct.ccacagccccc 58140 

aoctcggga ggogaggcc ctcg.ggcca ttcctg.cct .ggcacccac ™cca 58200 
acc«g«gagccttgggcgggg.ctgttac.cct.gca.ggcgugacctccccacag. 58260 

agg^ga «ca«acc« ctggggggca ggcaggagg. gcgfgagg. ccagccctg 58320 
gcag.ccc.cecc«cgtggcataggccmgcca«ggg.catcgaggg.gggtggagac 58380 

tgumgaccacucccgc«gtc«agaaaggg..ccatctgtctgactctgtttgg 58440 
ag.ecagaccttggttgc.gtgccctgca,ggtgggc.ggggggcaccc.ccagcc«t= 58500 

tgaglgcatg gcctccctt gcagccata gcctgcccaa ccagtt^g tgtgcgagcg 58560 
gccagtgW octcalcaaa «goag«cg actcct^ cgactgutc gacggcccg 58620 
acgagcoat gtgtggtgag ccagc«c« g^ggggaa ggggcgtccg ggctggg«c 58680 
eoccaggaao g«gagma ggggaggaga cgtgccmc cagcgggga gggggctgtg 58740 
tgggagactc aggcggctgg gaggc«ctt gcgggaggca gggaagcc« teccagggca 58800 
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ggcttatggg tggcgtgaat tagtcggggt ctatcaggag gcagaaactc tatgagaatt 65760 
tgaacagaga aagttccgtc tacaggctta ttaccaggga ctggaatagc agaaattgaa 65820 
cagtgagatg tacagagaac tctaagaatg caggaatagg ccaggcatgg tggctcacac 65880 
ctgtcatccc agcactttgg gagaccaagg cgggtggatc acctgaggtc aggagttcga 65940 
gaccagcctg gccaacatag tgaaacccca tctctactaa aaatacaaaa aaattagctg 66000 
ggtgtggtgg cgcatgcctg taatcccagc ttctcgggag tctgaggctg gagaatcact 66060 
tgaacctggg aggcagaggt tgtagtgagc cgagatcatg ccattgtact ccagcctggg 66120 
caacaagagc gagactcagt caaaacaaca acaacgcagg aatagcagat gagccgaggt 66180 
ggggcctccc cagcccccac cccccacccc gcaccctggg ccgagatcca gtcctctttg 66240 
aatagggcct gggcgtggtt cacgggacat ctgagacatt gccgaggcgc tgcactggtg 66300 
gatcttgcca gaagtctgcc cagtgcagat ttgggcagaa tctcaaactg ccttgggatg 66360 
taggagagaa accaggcctg gtcaagttca tgggaagagg tggaaacaga ccccataggc 66420 
tggggcttgg gcagctgtag gaagccctct ctgctgcctc cctgcctgct ctctgctttg 66480 
aagcatcttc cccagtgccc ccagtctcat gccctctcaa cgttggggtc. aaatcctgag 66540 
gaatacccag actggctctc tgggccaaag aggaccctct ccagaaagag cagggcccag 66600 
tgcggcttcc taaagggcag gggaagggcc tggccactcc ccagaggcta ctcaccagcc 66660 
atcaggatag ccccaggaag caggccttct cgagcccatt ttattacttt attttattat 66720 
tttatttaattttaaatttattttttgagacagagtctcactctgttgcccaggct^^^^ 66780 
tgcagtggtg cgatctcaac ccactgcagc ctctgcctcc agggttcaag ggattctccc 66840 

acctcagcctcccaagtagctgggattacaggtgcccgccaccacacccggctaattte 66900 
atatttttag tagagacgag gtttcaccat gttggccagg ctggtctcga actcctgacc 66960 
tcaagtgatc cgcccgcctc ggcctcccaa agtgctaggt caagcccatt ttaaagttga 67020 
agaaactgag gctgaggtaa attccctccc cagggatcct gctgcagcca gaaggtggta 67080 



wo 01/92891 PCT/USOl/16946 

153 

ggctcccagc tgccgtgtgc accctgccta gagctctacc gtaacccatc tccgggagga 64380 
ggtgctattg ttttcctcat tttgcaacaa ggaggctgaa gaactgagca tgaaccactg 64440 
gcctgggtcg ttcggttggt aggcagtggg gccaggccat ccaactcaca accaccttct 64500 
actctgcttc ccccgcaccc tgaagtttgt tctgttttga ggacacagcc gtcacattct 64560 
tggtggctga acagcactcc ttgtcaggtg tggctgggcc cccactggag ggcatcatgg 64620 
tcctctctcc tgctgcggtt gaaccttggc tgtttcaacc actcctgcca agtggccctc 64680 
tgaaagggac agtccatctt ttctcagcag agggccacac tggcaaaacg gtccctggca 64740 
ccctttctct ccacctgtct aatatagagt aaaaatggta tcatgttaag atcttcattt 64800 
atatttattt tatcatgaat gatgtaagca tcattttgtg tgtttaagaa cctttgggcc 64860 
cagcgtgatg gcttgcagct gtaatctcag cactttagga ggctgagatg agcggatcac 64920 
ttgaggccgg gagtttgaga ccagcctggc caacatggag aaaccccgtc tctagtaaaa 64980 
atttaaaaat tagccgggta tggtgatccc agctacttgg gagtctgaag catgagaatt 65040 
gcttgaacat gggaggcgga ggttgcagtg agccgagatc gcgccattgc actccagcct 65 100 
gggcgacaga gcgagactct gtctcacaaa aaaaaaaaaa aaagaaaaga aaagaaatta 65160 
tcaatctcct cttttatggc atatatatat atatatatat atatatatat ttatttccct 65220 
ttcttggtta tgttcataaa ggcctcccct gctctgatca taaaaaacaa cttattttca 65280 
cactctctct cttttttttt tgagacagag ttttgctcct gttgcccagg ctggagtgca 65340 
gtggcgcaat ctcagctcac tgtaacctcc gcctcccggg ttggagtgat tctcctgcct 65400 
taccttcccg agtagctggg attataggca tgcaccacca tgcctggcta attttgtact 65460 
tttagtagag acgggggttt ctccatgttg gtcaggctgg tctcgaactc gcgacctcag 65520 
gtgatccacc cacctcggcc tcccaaagtg ctgggattac agacgtgagc caccatgccc 65580 
agcccacact ctctttctta acgtcctcct cctttcgttt tacgttcaca tctttaattc 65640 
ttctgggatg taattagatt tgatgagcaa ggtgggcatc cagcttgttt cttggctgat 65700 
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cggggccggg gaggggcggg gcgggatggg gctgtgggcc cctcccaccg toagtgctgg 61620 
ccaccggagg cttcccgggt tcctgggggc tgtgccaccg cctctgaggc atgcttgctt 61680 
tcttcccttt tcaaaccctt ctgcttcctt ctttaatgac attgttgatt gtggataatc 61740 
tgaaaactac acaaaaatat aaagagccaa aatctcaccc aaatccacct cctagagtgg 61800 
ctgttgggct ccgtcagcat ccaggcggcc gtctgtgttc cgcacggccc agcccatcga 61860 
tagccgcctg caccaggcct gtctgccctc tgtgagcctc cccacagggt tccctccaca 61920 
aacaccctgt tctcccaccc agggctggct gcttcctgga aaacagctgg atggttttgt 61980 
gcatgacaga caaacacagg gtgattttcg tggctaaaat actccctgga gcttttggca 62040 
gggtgagggg ctggctccag ctgagccacg ccttgagtga aatgactgtg aggagaataa 62100 
actgccgctg ccctccagga tcactggggc tggctgggga gaacccccgt ttctgggagc 62160 
acagtcccag gatgccaagg cgagcttggt gccgagatgt gaactcctga gtgtaaacag 62220 
cgggggctga cttgacatgc tt^tgct mcatitgt tcctgcagct gtatgcccct 62280 
aaggtgagtc cagccccctt ctgcttcctc tggggcctcg ccagtgagcc ccaccttgct 62340 
ggggctggtt cctcctgccc ttctgggtat ccctcacatc tggggtcttg tcttcttgtt 62400 
ttatttttct tttttttttg agacggagtt tcacttttgt tgcccaggct tcagtgcaat 62460 
ggtgtgatct ctaggctcac cgcaacctct gcctcccagg ttcaagcagt tctcctgcct 62520 
cagcctccct agtagctggg attacaggca tgtgccacca cgcccagcta attttgtatt 62580 
tttagtagag atggggtttc tccatgttgg tcaggctgat cttgaactcc ctacctcagg 62640 
tgatccgccc accttggcct cccaaagtgc tgggattaca ggcgtgagcc accgcacctg 62700 
gcctttttct tttcttttct tttctttttt ctgagacagg gtctcgctct gtcacccagg 62760 
ctggagtgca atggtgtcat catggctaac tgcagcctct accttctagg ctcaagcaat 62820 
cctcccatct cagcccctaa gtagctagga ctgcacgcat gcatccccat gcccagctaa 62880 
tatttacatt ttttgtagag atgaagtttc actatattgc ccaggctggt ctccaactcc 62940 
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aaacaggact tcacccgggt ctgtctggcg tgaaaggcag tgtlcttgta ccaccctagg 67140 
gggcctgaga gaactgagtc cctcgggcat aactgacagt tctgttccca ttattccgca 67200 
ggggctcgga tctggctgta tgctttccag gatggccttg gagacccaca taagccctac 67260 
accctttggg aagctgcatg ttgggttggg gtgccgtcag tggcacttgt ggaaggtgca 67320 
gacctgtgtg ggtgtgtggg cccagggccc ctggtccctt cctccctttg tagggctggt 67380 
tgtgtgctgc ctggacctgg ggggcacgtt cacgtggtga atttgtctat ttactatccc 67440 
cgctttgggg ctggtgccag cacaggccct tgtgaagggg gtgcctttgt ctggagtggg 67500 
actgtggccc ctccctcagc gtggtgactt ctgtgtcagg gcttcagcag ggacgcagag 67560 
cccctgagtg ttcggaacaa gggcgtcatt gcaggagtta gactgtgtgt gatggaggga 67620 
ggaggggcag gaggaaaggt cagaaggaga gttcctggga aggtccctga ggagcctggt 67680 
gaggtgctaa ctggtgtgga ggacactcag ggcctgtggg gacatctcct actgctgggg 67740 
gccagccaca aagggaactg gccgaagtcc tgtccccgcc ttcacagccc agcatctggt 67800 
cacaaggcag gtacttggaa gggcgcgggc acctgggcca aaagtgcctg ggttcccttt 67860 
gcctttcact gagatgacct tcggggcagg tggctgctgc ctcccctcct gtccccaggt 67920 
tttgccaact ggccagagga aggggtcctg ggaagcaggg gggccagaag ccctctctgc 67980 
aaggaaagcc cgaggggtgt gggaggaagg aaggaatgcc caggctggcg aggctctaag 68040 
tcaccctggc ttggctctcc tcagatcctg aacccgccgc cctccccggc cacggacccc 68100 
tccctgtaca acatggacat gttctactct tcaaacattc cggccactgc gagaccgtac 68160 
aggtaggaca tcccctgcag ccctccatgg ccattgggtt cccgccagcc cgtggtggag 68220 
gggcctaatc cccatgccac tgatgagggg aggtattctg ggtgctagtg ggcaggtgcc 68280 
gggccc^cc ctgcctccct ctgctctgcc aaccacacta ggctgcctcc ccagacaagc 68340 
tcagcgggca ctgcatgttg ggttcagaaa tcagcagaac tccacgttct gagctgctct 68400 
toaagttgct cctatggggg ttacttttaa gctgggaaat ggq^tgtggcg tcgaggggcc 68460 
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ctaattaaag agacggggca gtggacaggc attttcagtt gactgcccag ggagtgttct 180 
gcccaacagg gaggatatgc gtacagaatc atactcgatc agcatgagtc caattcagac ^ 240 
cgtacatcag tggagatatg ggtcccccga tgactccgtg gaacactgat gtttgtgaca 300 
ggggagtaca gcaccagcca tcagcaggcc agtaaatcat accggcctgc gaaattggac 360 
tcagacccgg atccaccctg accgacgtcc caagccccca ccccccaccc cccaccatgg 420 
gccgagatcc agtcctcttt gaatagggcc tggccgtggt tcacgggaca tctgagacat 480 
tgccgaggcg ctgcattggt ggatcttgcc agaagtttgc ccagtgcaga tttgggcaga 540 
atctcaaact gcxttgggat gtaggagaga aaccaggcct ggtcaagttc atgggaagag 600 
gtggaaacag accccatagg ctggggcttg ggcagctgta ggaagccctc tctgctgcct 660 
ccctgcctgc tctctgcttt gaagcatctt ccccagtgcc cccagtctca tgccctctca 720 
acgttggggt caaatcctga ggaataccca gactggctct ctgggccaaa gaggaccctc 780 
tccagaaaga gcagggccca gtgcggcttc ctaaagggca ggggaagggc ctggccactc 840 
cccagaggct actcaccagc catcaggata gccccaggaa gcaggccttc tcgagcccat 900 
tttattactt tattttatta ttttatttaa ttttaaattt attttttgag acagagtctc 960 
actctgttgc ccaggctgga gtgcagtggt gcgatctcaa cccactgcag cctctgcctc 1020 
cagggttcaa gggattctcc cacctcagcc tcccaagtag ctgggattac aggtgcccgc 1080 
caccacaccc ggctaatttt catattttta gtagagatga ggtttcacca tgttggccag 1 140 
gctggtctcg aactcctgac ctcaagtgat ccgcccgcct cggcctccca aagtgctagg 1200 
tcaagcccat tttaaagttg aagaaactga ggctgaggta aattccctcc ccagggatcc 1260 
tgctgcagcc agaaggtggt aaaacaggac ttcacccggg tctgtctggc gtgaaaggca 1320 
gtgttcttgt accaccctag ggggcctgag agaactgagt ccctcgggca taactgacag 1380 
ttctgttccc attattccgc aggggctcgg atctggctgt atgctttcca ggatggcctt 1440 
ggagacccac ataagcccta caccctttgg gaagctgcat gtta^gttgg ggtgccgtca 1500 
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ctgtcatcca ggctggagtg cagtggtaca atctcagctc actgcaagct ccgactccca 71280 
ggttcaagtg agtctcctgc ctcagcctcc cgagtagctg ggactacagg tgcgcgccac 71340 
cacacccgcc cagctaattt ttgtattttt agtagagatg gggtttcacc atgttggcca 7 1400 
ggatgatctc gatctcttga cctcgtgatc cgcccacctc ggcctcccaa agtgctggga 71460 
ttataggcatgagccactgtacccagctgactcttagtcacttttaagaaggggactgtg 71520 

' ccttcatttt tcactgggcc ctgcagaata tatgcctggg ctctgggctc ttctgaacct 71580 
gtgttggctt ccatctgacc tctctgtgcc agcccaaggc tgctgctctt cctgagggca 71640 
aggagcccca tgactgcgtg ttgactcgct ggatggggct gctgagccca ctctgccaca 7 1700 
ccacgtgccc ctggcaggga gggaatccct gggtcctcac aggaacagtc agcaagccac 7 1760 
acctgacgcc tgctgtgggc ccatccctgc ggtgctggag aagacagaca aggcctggtc 7 1820 
actgcctctg cagggtcccc agtccgtgga aggagacagt aatctaggca ttttcggtgg 71880 
ggaagctgag ctgttctcgt gtcctgaagg ccaggcggga acagccgtct tcagagggaa 71940 
gggagaaaat gcacatcgca tcagtggaga agggcctgac ttccctcagc atggtggagg 72000 
gaggtcagaa aacagtcaag cttgagtatt ctatagtgtc acctaaata 72049 



<210> 10 
<211> 8705 
<212> DNA 
< 213 > Homo sapiens 

<400> 10 

ggactcaggg gcagcaggga ggtacaccca tggttagtgg gcggaccata gggggtaatg 60 
agagggtgaa tcgatggaac ctgggggaca caatcgaagt ggttccagag tcgggctgta 120 
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gaaccacccc catgattcaa tcatctccca ctgggtccct cccacagcac gtgggaatta 69900 
tgggagtaca attcaagatg agatttgggt ggggacacag ccaaacccta tcggttgcca 69960 
acatttacag taacagtgtt aggtgaacag ttgtccagtc tcctgttitg tcggacactg 70020 
tttctagcac cttccaggca gaatctcatg tatccttcac tttcgaaatg ggtactattt 70080 
catccccact tttatcaatg agaaactaaa gctcgaagag gtcaagtaag ttcctggcca 70140 
aggtcagcta gcaggctcta gaggcctcgt tctccttaga ggcagccttg ccagggccca 70200 
ggcttggcag gctgcagggc aggtgcgggc atgcccatgg tagaggtggg accattgagg 70260 
ctcagagagg gtaagtgatg agccctggcg acacagcggg gtgggtccag agtccggcct 70320 
gcatcttctg gagctggcca gtggacaggc ctttcccgtt cacagccccg gggctgctgt 70380 
gcccaccagg gcggatgtgc ctaccgaatc ccactcctct gtgtgtgtcc ctttcaggcc 70440 
ctacatcatt cgaggaatgg cgcccccgac gacgccctgc agcaccgacg tgtgtgacag 70500 
cgactacagc gccagccgct ggaaggccag caagtactac ctggatttga actcggactc 70560 
agacccctat ccacccccac ccacgcccca cagccagtac ctgtcggcgg aggacagctg 70620 
cccgccctcg cccgccaccg agaggagcta cttccatctc ttcccgcccc ctccgtcccc 70680 
ctgcacggac tcatcctgac ctcggccggg ccactctggc ttctctgtgc ccctgtaaat 70740 
agttttaaat atgaacaaag aaaaaaatat attttatgat ttaaaaaata aatqt?i?jttg 70800 
ggattttaaa aacatgagaa atgtgaactg tgatggggtg ggcagggctg ggagaacttt 70860 
gtacagtgga gaaatattta taaacttaat tttgtaaaac agaactgcca ttcttttgtg 70920 
ccctgtgtgc atttgagttg tgtgtccccg tggagggaat gccgaccccc ggaccaccat 70980 
gagagtcctc ctgcacccgg gcgtccctct gtccggctcc tgcagggaag ggctggggcc 71040 
ttgggcagag gtggatatct cccctgggat gcatccctga gctgcaggcc gggccggctt 71 100 
tatgtgcgtg tggcctgtgc cgtcagaaag ggccctgggc ttcatcacgc tgttgctgtt 71 160 
cgtcttcctc agattcttag tctttttttt tttttttttt ttttgagacg gagtctttct 71220 
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gcggccagga ggacagactg tgagctgtgg gctcggcggc tacagagtct gcctcagtgg 58860 
gcggggctga tggtgtccag gtgcctgcag cacgcaccca cccacgggac cttgctgagc 58920 
agcgtctgtc aggcagcaag attacccgag ggctgcagtg gtcctgttcc ctggcagctt 58980 
actgtctggc tgaggaggag tgatgttcac atatgcacac atgtcatgtg cacacacatg 59040 
tacatgacaa catcccacat gctcctcaaa tagcatgacc tgtacagtca cggatatagg 59100 
gcctagggga taggaggcca agacagtcag ggaagacttt ccagaggcag tggctcctga 59160 
aaggctgtct gattcaggca ggaagggagc tgagttcaga taggaagtag caatgagtca 59220 
ttgtgtctgg ggacatggcc actccttcgc tgcagaggga cctgggctga gagctcctct 59280 
cttatggctg cagtcgggag agaagtctgt tggggggaga agggggcttc ctcaagggac 59340 
tccctgtgcc ctttggcacc ttcgtgccag gtcaggcttg aggcctgaag gcagtggtgg 59400 
gggccaccaa gggtcgcctc ctctgctggg caagttccca gtctgacggg cctgtgccgt 59460 
gggccccagc tgtgggggcg ctgttgatgc gcagccaggc ctcgccgcca gagcccgcac 59520 
gcttccattc cgctgacttc atcgacgccc tcaggatcgc tgggccggcc ctgtgggaga 59580 
gtgaatgtgg cttttgccaa agttgagtct ggagcctgga aacttcccta tgggcagcct 59640 
tgatagtgga gtggcccaag gagcccaccc agccgaccct gcccctcccg tggctggtgg 59700 
gcggcaccag gggctgcctg gctttgctcg ttcaccaaca tcacccgggc tggccagggc 59760 
gcgctcactt ctgccaccac cgagggccct gggcgaagga gtgaatacca ggctgccttg 59820 
gcagggatgt gttgagggct gtggggagtc ggacagcggc gggggtcaga ggaggaggag 59880 
ggtgcaccgt gcaggctgaa gggccacgtt accctgaggt tggccaggct ccccaggcct 59940 
agcctcccag ctcccccact ttctccccac cctccaccag tggcaaagcc agccccttca 60000 
gggcgcacgg tgtctgcccc caaggagggc ccattccgtt ggggttaatg ttggccacct 60060 
ctttctgttt gtctctggca gaaatcacca agccgccctc agacgacagc ccggcccaca 60120 
gcagtgccat cgggcccgtc attggcatca tcctctctct_cttcgtcatg ggtggtgtct 60180 
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gtggcacttg tggaaggtgc agacctgtgt gggtgtgtgg gcccagggcc cctggtccct 1560 
tcctcccttt gtagggctgg ttgtgtgctg cctggacctg gggggcacgt tcacgtggtg 1620 
aatttgtcta tttactatcc ccgctttggg gctggtgcca gcacaggccc ttgtgaaggg 1680 
ggtgcctttg tctggagtgg gactgtggcc cctccctcag cgtggtgact tctgtgtcag 1740 
ggcttcagca gggacgcaga gcccctgagt gttcggaaca agggcgtcat tgcaggagtt 1 800 
agactgtgtg tgatggaggg aggaggggca ggaggaaagg tcagaaggag agttcctggg I860 
aaggtccctg aggagcctgg tgaggtgcta actggtgtgg aggacactca gggcctgtgg 1920 
ggacatctcc tactgctggg ggccagccac aaagggaact ggccgaagtc ctgtccccgc 1980 
cttcacagcc cagcatctgg tcacaaggca ggtacttgga agggcgcggg cacctgggcc 2040 
aaaagtgcct gggttccctt tgcctttcac tgagatgacc ttcggggcag gtggctgctg 2100 
cctcccctcc tgtccccagg ttttgccaac tggccagagg aaggggtcct gggaagcagg 2160 
ggggccagaa gccctctctg caaggaaagc ccgaggggtg tgggaggaag gaaggaatgc 2220 
ccaggctggc gaggctctaa gtcaccctgg cttggctctc ctcagatcct gaacccgccg 2280 
ccctccccgg ccacggaccc ctccctgtac aacatggaca tgttctactc ttcaaacatt 2340 
ccggccactg cgagaccgta caggtaggac atcccctgca gccctccatg gccattgggt 2400 
tcccgccagc ccgtggtgga ggggcctaat ccccatgcca ctgatgaggg gaggtattct 2460 
gggtgctaat gggcaggtgc cgggcccagc cctgcctccc tctgctctgc caaccacact 2520 
aggctgcctc cccagacaag ctcagcgggc actgcatgtt gggttcagaa atcagcagaa 2580 
ctccacgttc tgagctgctc ttcaagttgc tcctatgggg gttactttta agctgggaaa 2640 
tggctgtggc gtcgaggggc cgggggcttg ggctccagag tctgactgtg tgtttgagtc 2700 
cggctgtgga aacctagcca ttgagatgcc ccctcttggt ggctcLgtcc tcttaggatg 2760 
ggacaagtct gtgaaggctg ctgcagcacc caccgtagac ccctaatcgt gtgacgtcac 2820 
caggatggtc cgggctgctc acttgccaca gtggcctgtt tgagcccggg aagccaacgg 2880 
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tcggcctccc aaagtgctgg gattataggc atgagccact gtacccagct gactcttagt 5700 
cacttttaag aaggggactg tgccttcatt tttcactggg ccctgcagaa tatatgcctg 5760 
ggctctgggc tcttctgaac ctgtgttggc ttccatctga cctctctgtg ccagcccaag 5820 
gctgctgctc ttcctgaggg caaggagccc catgactgcg tgttgactcg ctggatgggg 5880 
ctgctgagcc cactctgcca caccacgtgc ecctggcagg gagggaatcc ctgggtcctc 5940 
acaggaacag tcagcaagcc acacctgacg cctgctgtgg gcccatccct gcggtgctgg 6000 
agaagacaga caaggcctgg tcactgcctc tgcagggtcc ccagtccgtg gaaggagaca 6060 
gtaatctagg cattttcggt ggggaagctg agctgttctc gtgtcctgaa ggccaggcgg 6120 
gaacagccgt cttcagaggg aagggagaaa atgcacatcg catcagtgga gaagggcctg 6180 
acttccctca gcatggtgga gggaggtcag aaaacagtca agcttgttgc tgggtgacag 6240 
tgcatttaat aatcaaaata taggctgggt acggtggctc atgcctgtaa tcccagcact 6300 
ttgggaggct gaggcaggtg gatcacttga ggccaggagt ttgagaccgg cctggccaac 6360 
atggcaaaac ctcaactact aaaatacaaa aactagccgg gcgtggtggt gcacgcctgt 6420 
aatcccagct acttgggagg ctgaggcagg agaattgctt gaacctggga ggcggaggct 6480 
gcagtgagcc gagattgtgc cactgcactc cagcctgggc aacagagcaa gactctgtct 6540 
caaaaaaaaa ff^aaaaaaaa gcaatacaaa atacaaatat cactttcact aaaagaaggg 6600 
atggaagacc caaaacaaac agaaaacaac aaaatggcag gagtaagtcc ccacttatca 6660 
, ataataacat tgactgtaaa taggctaagc tctgcaatca aaagagtggg ccaggagcgg 6720 
tggctcacgc ctgtaattcc aacgctttgg gaggctgagg cggatggatc atttgatgtc 6780 
acgagtttta agaccagcct ggccaacaag gtgaaacccc atctgtacta aaaatacaaa 6840 
aattagccag gcggtagtgg cacgcacctg taatcccagc tacttgtgag gctgaggcag 6900 
gagaatcact ggaggctggg aagcggaggt tgctgtgagc caagatggag ccactgcact 6960 
cccacctggg cgacagagtg agatcctgtc ttaagaaaaa aaagagtgga tgaatggatc 7020 



wo 01/92891 

PCTAJSOl/16946 

162 

gtaatgaacg tgaccaggag ctgcttactg aggacgcact ggatgatctc atcccttctt 660 
ttctactgac tggtcaacag acaccggcgt tcggtcgaag agtatctggt gtcatagaaa 720 
ttgccgatgg gagtcgccgt Ggtaaagctg ctgcacttac cgaaagtgat tatcgtgttc 780 
tggttggcga gctggatgat gagcagatgg ctgcattatc cagattgggt aacgattatc 840 
gcccaacaag tgcttatgaa cgtggtcagc gttatgcaag ccgattgcag aatgaatttg 900 
ctggaaatat ttctgcgctg gctgatgcgg aaaatatttc acgtaagatt attacccgct 960 
gtatcaacac cgccaaattg cctaaatcag ttgttgctct tttttctcac cccggtgaac 1020 
tatctgcccg gtcaggtgat gcacttcaaa aagcctttac agataaagag gaattactta 1080 
agcagcaggc atctaacctt catgagcaga aaaaagctgg ggtgatattt gaagctgaag 1140 
aagttatcac tcttttaact tctgtgctta aaacgtcatc tgcatcaaga actagtttaa 1200 
gctcacgacatcagtttgctcctggagcgacagtattgtataagggcgataaaatggtgc 1260 
ttaacctggacaggtctcgtgttccaactgagtgtatagagaaaattgaggccattctta 1320 
aggaacttga aaagccagca ccctgatgcg accacgtttt agtctacgtt tatctgtctt 1380 
tacttaatgt cctttgttac aggccagaaa gcataactgg cctgaatatt ctctctgggc 1440 
ccactgttoc acttgtatcg tcggtctgat aatcagactg ggaccacggt cccactcgta 1500 

tcgtcggtct gattattagt ctgggaccac ggtcccactc gtatcgtcgg tctgattatt 1560 
agtctgggac cacggtccca ctcgtatcgt cggtctgata atcagactgg gaccacggtc 1620 
ccactcgtat cgtcggtctg attattagtc tgggaccatg gtcccactcg tatcgtcggt 1680 
ctgattatta gtctgggacc acggtcccac tcgtatcgtc ggtctgatta ttagtctgga 1740 
accacggtcc cactcgtatc gtcggtctga ttattagtct gggaccacgg tcccactcgt 1800 
atcgtcggtc tgattattag tctgggacca cgatcccact cgtgttgtcg gtctgattat 1860 
cggtctggga ccacggtccc acttgtattg tcgatcagac tatcagcgtg agactacgat 1920 
tccatcaatgcctgtcaagggcaagtattgacatgtcgtcgtaacctgtagaacggagta 1980 
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ggtgggtgct tcgtggcgtg gttctgaaac ttcgttggaa gtgtgtggac agtgccttgc 3420 
ctgttctctg tgggacccta tttagaaacg aggtctgagt tactgggggt catcactgtg 3480 
ttctgatggc ccagctgtgt ggaggccgcg gtgcagcccc atccaaggag ccagggccct 3540 
gggtctagcc gtgaccagaa tgcatgcccc ggaggtgttt ctcatctcgc acctgtgttg 3600 
cctggtgtgt caagtggtcg tgaaactctg tgttagctct tggtgttcct gaaagtgccc 3660 
ccgggtctca ggcctcagaa ccagggtttc ccttcatctc ggtggcctgg gagcatctgg 3720 • 
gcagttgagc aaagagggcg attcacttga aggatgtgtc tggccctgcc taggagcccc 3780 
ccggcacggt gctggggcct gaagctgccc tcgggtggtg gagaggaggg agcgatgaag 3840 
tggcgtcgag ctgggcagga agggtgagcc cctgcaaggt gggcatgctg gggacgctga 3900 
gcagcatggc cagcagctgg gtctgcagcc tggtacccgg cgggacttgt ggttggggct 3960 
ggtttgtggc caggagaggg gctggcagga gacaaggggg actgtgaggc agctcccacc 4020 

cagcagctga agcccaatgg cctggctgtg tggctctcag ctgcgtgcat aacctctcag 4080 
tgcttcagtt ctctcatttg taaaatgagg aaacaaacag tgccagcctc ccagaggtgt 4140 
catgaggatg aacgagtgac catgtagcat gggctgggtg cgtgtcacct aacatcacca 4200 
gcctttgcaa ggagagccct gggggcx;tgg ctgagtattt cccttgcccg gcccacccca 4260 
ggcctagact tgtgcctgct gcaggccctt gacccctgac cccattgcac ctgtctccac 4320 
aggagccgag gaggtgctgc tgctggcccg gcggacggac ctacggagga tctcgctgga 4380 
cacgccggac ttcaccgaca tcgtgctgca ggtggacgac atccggcacg ccattgccat 4440 
cgactacgac ccgctagagg gctatgtcta ctggacagat gacgaggtgc gggccatccg 4500 
cagggcgtac ctggacgggt ctggggcgca gacgctggtc aacaccgaga tcaacgaccc 4560 
cgatggcatc gcggtcgact gggtggcccg aaacctctac tggaccgaca cgggcacgga 4620 
ccgcatcgag gtgacgcgcc tcaacggcac ctcccgcaag atcctggtgt cggaggacct 4680 
ggacgagccc cgagccatcg cactgcaccc cgtgatgggg taagacgggc gggggctggg 4740 
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acctcggtgt gcggttgtat gcctgctgtg gattgctgct gtgtcctgct tatccacaac 2040 
attttgcgca cggttatgtg gacaaaatac ctggttaccc aggccgtgcc ggcacgttaa 2100 
ccgggctgca tccgatgcaa gtgtgtcgct gtcgacgagc tcgcgagctc ggacatgagg 2160 
ttgccccgta ttcagtgtcg ctgatttgta ttgtctgaag ttgtttttac gttaagttga 2220 
tgcagatcaa ttaatacgat acctgcgtca taattgatta tttgacgtgg tttgatggcc 2280 
tccacgcacg ttgtgatatg tagatgataa tcattatcac tttacgggtc ctttccggtg 2340 
atccgacagg ttacggggcg gcgacctcgc gggttttcgc tatttatgaa aattttccgg 2400 
tttaaggcgt ttccgttctt cttcgtcata acttaatgtt tttatttaaa ataccctctg 2460 
aaaagaaagg aaacgacagg tgctgaaagc gagetttttg gcctctgtcg tttcctttct 2520 
ctgtttttgtccgtggaatgaacaatggaagtccgagctcatcgctaataacttcgtata 2580 
: g^tacatta tacgaagtta tattcgatgc ggccgcaagg ggttcgcgtc agckggtgtt 2640 
ggcgggtgtc ggggctggct taactatgcg gcatcagagc agattgtact gagagtgcac 2700 
catatgcggt gtgaaatacc gcacagatgc gtaaggagaa aataccgcat caggcgccat 2760 
tcgccattca ggctgcgcaa ctgttgggaa gggcgatcgg tgcgggcctc ttcgctatta 2820 
cgccagctgg cgaaaggggg atgtgctgca aggcgattaa gttgggtaac gccagggttt 2880 
tcccagtcac gacgttgtaa aacgacggcc agtgaattgt aatacgactc actatagggc 2940 
gaattcgagctcggtacccggggatcctctagagtcgacctgcaggcatgcaagcttctc 3000 
ttgtgccggttgtacgctgtcaggtcacactggtgagttaggcagggcacagatgcccag 3060 
agcagaggga actttccttg gggattcaac acgtgcaagt cttaggggct ggcaaatcct 3120 
gccctcagct agagaggggg cttttatttg agaccagaat cacctgagca tcctcctgtc 3180 
cc^gctgtgtccagcctgtctgcagggacatcctgagaggaccaggctctcccctcatc 3240 
cacctgcctaagtgccactctgaaccctgtccacctgtgccgtggaggggcgtgaccte^ 3300 
agctgctcag ccagcagcag gcttggccct ggggggcagc agagacccag gtggctgtgg 3360 



BNSDCCID: <WO oib^roiap i > 



PCTAJSOl/16946 

WO 01/92891 

gccggagcc agggccaggc caagcacgg cg^aggE^g afg^-tgg ac=tg.catt 4800 
agggaeae. g.ct«ca,c agaacccgga ggagggcttg ttaaaacac. ggcagctggg 4860 
ccccacoccc agagcggtga ttcaggagC ccagggcggg gctgaagac. tgggtttcm 4920 
acaagcaccc cagtggt.cg gtg«gc.go .gggtccatg cgtagaaagc cctggagacc 4980 
tggagggagoccmgttcccctggcttcagmccK^atctgtagaatggaacggtcca 5040 

tctgggtgat ttccaggatg aeagtagtga cagtaagggc agcctctg.g a«ctgacca 5100 
cagtacaggccaggccttttmttcmtttttttmgagatggagK.cactc.gtc 5160 

gcccaggctg gag«!cagtg gtgtgatctc agcttactac aacctctgcc ttCgggctc 5220 
aagtgattct cctgcctcag cc.cc.gagt agctgggatt acaggtgcc. gccactgtgc 5280 
ttggctaatgtttgtatttttggtagagatggggmcaccgtcttggccaggctggtcg 5340 
caaactcctgacctcaggtgatccacctgcc^^agcctcccaaagtgctgggattacagg 5400 
catgagccac cacgcccggt caggccaggc ccttttgaa cacmgcac accatgggB 5460 
muatcca ggggggmgg ucagttgu cagttgagga cactgaagcc cagagaggct 5520 
cagggacttgcccaggg«:acacagcagga.g.ggcagg.g.ggggc.gggcc.ggcagc 5580 

gtggcttcag cmccagca tagaaa«*g tgaaagcaga tagmgttg gttggtaggg 5640 
"gagacttB. gagacccgcc ccagcggctc agagggtag. agccaggggc cttcctgggg 5700 
gctcataacc cagaacacg aatgggaaaa cccgatgga ggaggcgcag tggagctgtg 5760 
ggtgccgatgggaagtcccagaggagctgggaggtcagtagcggtgc.gccc.c.g.gga 5820 

gcacttagtg ggcaccaggt gtgmccag gttcawgcc ctgggacctg aagctt^agaa 5880 
ggtgaagtaa cttgcccagg gcacccgtcg ggcagcggcg ggcagaggat ttg.gggctg 5940 
,gg^cc.gtgctt:gtggcccagccagggggng.gag.g.g«K'==ggS8''8'^«" ^ 
cc.gcaag.g gactggtgtc utggagccag ca.gtcaggc agcaggcagc gggagtgcag 6060 
caggcagcgg gagcacagca ggcagagggc ggggc.cga^ =agcca.ccg tggaccctgg 6120 
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ggcacggagg catgtgggag agggctgctc catggcagtg gctgaagggc tgggttgtgc 6180 
cccgaggagg gtggatgagg gtaagaagtg gggtccccag gggctttagc aagaggaggc 6240 
ccaggaactg gttgccagct acagtgaagg gaacacggcc ctgaggtcag gagcttggtc 6300 
aagtcactgt ctacatgggc ctcggtgtcc tcatctgtga aaaaggaagg gatggggaag 6360 
ctgactccaa ggcccctcct agccctggtt tcatgagtct gaggatccca gggacatggg 6420 
cttggcagtc tgacctgtga ggtcgtgggg tccagggagg ggcaccgagc tggaagcggg 6480 
aggcagaggg gctggccggc tgggtcagac acagctgaag cagaggctgt gacttggggc 6540 
ctcagaacct tcacccctga gctgccaccc caggatctgg gttccctcct tggggggccc 6600 
cagggaacaa gtcacctgtc ctttgcatag gggagccctt cagctatgtg cagaaggttc 6660 
tgctctgccc cttcctccct ctaggtgctc agctcctcca gcccactagt cagatgtgag 6720 
gctgccccag accctgggca gggtcatttc tgtccactga cctttgggat gggagatgag 6780 
ctcttggccc ctgagagtcc aagggctggt gtggtgaaac ccgcacaggg tggaagtggg 6840 
catccctgtc ccaggggagc ccccagggac tctggtcact gggcttgccg ctggcatgct 6900 
cagtcctcca gcacttactg acaccagcat ctactgacac caacatttac aaacaccgac 6960 
attgaccgac accgacattt accgacactg acatttacca acactgttta ccaacactga 7020 
catctactga cactggcatc taccaacact gacatttacc gacactgaca tttaccaaca 7080 
ctatttacca acactgacat ctactgacat tggcatctac caacaccaac atttaccgac 7140 

accaacatttaccaacactgaaatttaccgacaccgacatttaccgacaccgtttaccaa 7200 
caccgacgtt taccgacacc gacatttacc gacactgata tttaccaaca ctgacatcta 7260 

ctgacgctgg catctactga caccgatgccagcatctaccaacaccgaca tttaccaaca 7320 
ctgacatttactgacactgatatctactga cactggcatc tactgacaccaacatttacc 7380 
aacaccagcatctaccaacaccgacatttaccaacaccagcatttaccaacaccgatgtt 7440 
taccaacgcc gacgtttacc gacgccagca tctaccaaca cteacattta ccgacaccga 7500 
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catttaccga cactgacatt tactgacact gacatctact gatactggca tctaccgaca 7560 
ctgatattta ccaacgccag catctactga cactgatgtt taccaacacc gacatttacg 7620 
agcaccgaca tttactgaca ccaatattta ctgacatcaa catttagcca tgtgatgggg 7680 
gccggcttgg gggcaggcct tgctcttggc actggggatg ctgcagagac cagacagact 7740 
catggggtca tggacttctg cttcttctcc agcctcatgt actggacaga ctggggagag 7800 
aaccctaaaa tcgagtgtgc caacttggat gggcaggagc ggcgtgtgct ggtcaatgcc 7860 
tccctogggt ggcccaacgg cctggccctg gacctgcagg aggggaagct ctactgggga 7920 
gacgccaaga cagacaagat cgaggtgagg ctcctgtgga catgtttgat ccaggaggcc 7980 
aggcccagcc accccctgca gccagatgta cgtattggcg aggcaccgat gggtgcctgt 8040 
gctctgctat ttggccacat ggaatgcttg agaaaatagt tacaatactt tctgacaaaa 8 100 
acgccttgag agggtagcgc tatacaacgt cctgtggtta cgtaagatgt tatcattcgg . 8 160 
ccaggtgcct gtagacacag ctacttggag actgaggtgg gaggatcgct ggagtccaag 8220 
agtttgaggc cagcccgggc aaaggggaca caggaatcct ctgcactgct tttgccactt 8280 
actgtgagat ttaaattatt tcacaataca aaattaagac aaaaagttaa tcacatatcc 8340 
actgccctgc ttaagacaga aaacatgggt gttgttgaag ccagaggcag ctgctggcct 8400 
gagtttggtg attggttcct aagcagttga aggcagtttt gtttttccat agatgtctgt 8460 
tctccctttg ctgggtgcag cctcgccctg ctgctgtggt cgggtttcag tggcctcgtc 8520 
ccgtggacgc agcctcgccc tgccgctgtg gtcgggffic agtggcctcg tcccgtggac 8580 
gcagcctcgc cctgctgctg tggtcgggtt tcagtggcct cgtcccgtgg acgcagcctc 8640 
gccctgccgc tgtggtcggg tttcagtggc ctcgtcccgt ggacgcagcc tcgccctgcc 8700 
gctgtggtcg ggtttcagtg gcctcgtccc atgggcgtgc tttggcagct ttttgctcac 8760 
ctgtggagcc tctcttgagc ttttttgttt gttgtttgtt tttgtttgat tttgtttgat 8820 
tgtttgtttt tgttgtcgtt gttgttgccc aggctggagt gcagtggcgc gatctcagct 8880 
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cactgaaacc tctgcctcct tgggttcatg ccattctcct gcctcagcct cccacatagc 8940 
tgggattaca agtgcccgcc accacgcctg gctaaatttt gtatttttag tagacagggg 9000 
gtttcaccat gttggtcagg ctggtctgga actcctggtc tcacatgatc cacctgcctc 9060 
ggcctcccaa agtgttggga ttacaggcgt gagccaccgc gcccagcctc tgttgagcat 9120 
attttgaggt tctcttggtg ccagtgatat gtacatgtgt ccccatcgca ccatcgtcac 9180 
ccattgaggt gacattggtg cctctcctcg gggtggatgc ctccctctgt ttccagcaac 9240 
ttctgaagga ttttcctgag ctgcatcagt ccttgttgac gtcaccatcg gggtcacctt 9300 
tgctctcctc agggctccca ggggaggccc gaatcaggca gcttgcaggg cagggcagga 9360 
tggagaacac gagtgtgtgt ctgtgttgca ggatttcaga ccctgcttct gagcgggagg 9420 
agtctcagca ccttcagggt ggggaaccca gggatggggg aggctgagtg gacgcccttc 9480 
ccacgaaaac cctaggagct gcaggtgtgg ccatttcctg ctggagctcc ttgtaaatgt 9540 
tttgtttttg gcaaggccca tgtttgcggg ccgctgagga tgatttgcct tcacgcatcc 9600 
ccgctacccg tgggagcagg tcagggactc gcgtgtctgt ggcacaccag gcctgtgaca 9660 
ggcgttgttc catgtactgt ctcagcagtg gttttcttga gacagggtct cgctcgctca 9720 
cccaggcgag agtgcagtgg cgcaatcacg gctcgctgta gcctcaatct ccctgggctc 9780 
aggtgatect cctgcctcac cctctgagta gctgggacta cagacacata ccaccacacc 9840 
cagctagttt ttgtgtattt tttgtggggg gagatggggt ttcgctgtgg tgcccaagct 9900 
gatctcaaac tcctgaggca caagcgatcc acctgcctcg gcctcccaaa gtgctgggat 9960 
gacaggcatc agccgtcaca cgcagctcaa tgattttatt gtggtaaaat aaacatagca 10020 
caaaattgat gattttaacc attttaaagt gaacagttca ggctgggcgt ggtggcttat 10080 
gcttgtaatc ccagtacttt gagaggctga ggtgggcaga tcacctgagg tcaggagttt 10140 
gagaccagcc tggccaacat gatgaaatcc agtctctact aaaaatacaa aaattagccg 10200 
ggcatggtgg caggtgcctg taatcccagc tactcgggag gctgaggcag gagaatcgct 10260 
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tgagcccgggaggtggaggttgcagtgatctgagatcatgccactgcactccaatctgtg 10320 
tgacagagca agactctgtc ttgaaaaata aataaataaa aaaaatttta aaaagtgaac 10380 
aattcagggc atttagtatg aggacaatgt ggtgcaggta tctctgctac tatctacttc 10440 
tagaacactt tcttctgccc tgaaggaaac cccatgccca ccggcactca cgcccattct 10500 
cccctctctc ccagcctctg tcaaccacta atctactttc tgtctctggg ggttcacttc 10560 
ttctggacgt tttgtgtgac tggaatcctg caatatgtgg tccctgcgtg tggcttcttt 10620 
ccatagcatt gtgttttcca gattcaccca cacattgtcg cacgttatca gaatctcatt 10680 
cctgactggg tgcagtgggt taggcctgta atcctaacat tctgggaggc caaggcggga 10740 
cgatcacttg aggcaggagt ttgagaccag cctggccagc ctagcaagac cccagctacc 10800 
aaaaaatttt aaaagttaac tgaacgtggt ggtggtgggc acttgtggtt cccagctacc 10860 
tgggaggctg aggtgggagg atcgcttaag cccaggaggt caaggctgca gtgagctatg 10920 
atcgcaccac tgcactccag cctggacaac agagcaagac cctgtctgaa aaaaaaaaca 10980 
aaaaaaaaag ttcctttctt tttgtggctg gatgacatcc cattgtatgg ccacagcaca 1 1040 
ttttgtttgt ctgtttatcg ggtggtgggc agtggtttcc accttttgtc tcctgtgaat 1 1 100 
aatgctgctg tgaacatttg aattcaagtt tt^gaa cacctgttgt gaattatttg 1 1 160 
gatatatgtg taggggtagg attgctgagt cctatggtaa tgttaggttt gacttactga 1 1220 
ggaaccatta aactgttttc aacagtggct gcgccgttct gcatccccac cggcagtgtg 1 1280 
tgagggttct gactttacct cctcacaaac gcttcttttc catttaaaaa aatattcagc 1 1340 
caggtgctct ggctcacgcc tgtaatccca gcactttggg aggccgtggc gggcggatca 1 1400 
cctgaggtca ggagttcgag acgagcctgg ccaacatggt gtaaccccat ctctaccaaa 1 1460 
aatataaaaa ttagccgggt gtggcagcgg gcgcctgtaa tcccagctac ttgggaggct 1 1520 
gaggcaggag aatcacttga acccgggagg cagaggttgc agtgagccaa gatcgcgcca 1 1580 
ctacactcca gcctgggtga caagagtgaa aetccatcta aaataaaaca aaaataaaaa 1 1640 
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tccagcacac actcaccctg tctgtgcacc tgtttttgtg tccgtaagtg ggtatttact 32640 
caccttacga gtgagccact gtgggaattc agggaggtgg cgcagtgacc acccctggag 32700 
ggatatgtgt gtggcagggg tcgagggtct cgcccttccc tgcttcctgc gcgtggcttt 32760 
ctccaggacg gggagggctg agctgaagag gtggggacag ttgcgtcccc ccgccaccca 32820 
ctgtcctgcg gtgagagcag actcactgag cctgcccttc tcccttgtgc cttccagcta 32880 
catctactgg accgagtggg gcggcaagcc gaggatcgtg cgggccttca tggacgggac 32940 
caactgcatg acgctggtgg acaaggtggg ccgggccaac gacctcacca ttgactacgc 33000 
tgaccagcgc ctctactgga ccgacctgga caccaacatg atcgagtcgt ccaacatgct 33060 
gggtgagggc cgggctgggg ccttctggtc atggagggcg gggcagccgg gcgttggcca 33 120 
cctcccagcc tcgccgcacg taccctgtgg cctgcaagtt ccccaacctg gcaggagctg 33180 
tggccacacc cacgactgcc cagcagcctc accctctgct gtgggagttg tccccgtcca 33240 
cccctgggtg cctttgctgc agttatgtcg ggagaggctK tggtgacagc tgtttcctgt 33300 
gcacctgctg ggcactaggt cccagctaat ccctgtgcca ggactctaat ttcaccctaa 33360 
cacacatggt ggttttcatt gctggggaag ctgaggcctg agcacatgac ttgccttagg 33420 
tcacatagct ggtgagttca ggatccccca gagataccag ggccagcact cgatccccac 33480 
ccagccctga accccaccat gtgctgggat tgtgctggga gtgtccacac gcctgggacc 33540 
ccagggctgg tgctctcatc tcctttttcc agatcatgag aatgaggctc agggaagttt 33600 
gaaaaaaacc tatcccaagt cacacagcaa caggagcagg atttgaaccc agaaaagggg 33660 
accgcacact ctgttctgct agagtagtta gctgtcctgg gtgatatggc aggtgacagg 33720 
ggcaactgtg cttaacaaag gaacccccat cccccctgcc aagttgggag actagaaggt 33780 
caggggcaga agctctgaag ggccaggtgc agtggctgac acctctaatc ccagcacttt 33840 
gtgaggccaa ggcgggcaga tgatttgagc ccaggagttc aagatcagcc tgggtaatgt 33900 
agtgagacgc catctctaca aaaaaatttt ttaaaaatta gctgggcatg gtggttcatg 33960 



WOOl/92891 



ecgtagtoc aagctacttg ggaggctcag gtgggaggat tgcttgagcc caggaggttg 34020 
aggngtggt gagctgtgat ca«cca«g -Cccagcc .gggcaatag agtgagaccg 34080 

aaaaaaaaga agaaga^a gaagCctga ggctccaag. ccccaggcac 34U0 
ccc„ggc« gagggoagac aagggaggag aggg^acct gggcagccct gacmtgtc 34200 
ecc.ggcaaagggacc«cag.gacc«gg™agcCc.gagcacg.cagcoa 34260 
.gtcgaaccgccaggaagg gcagcaagaa «ggot.« gacctcgcc totcctaCc 34320 
gccatCca c«ggtg.gg Wgcccat ,».agatg aggag^gg ggeatcgacc 34380 
agcgaatgc c«g.cccag g«otg.g.a ggcagagcg gcagttgaac cccg.g.c« 34440 
ggttgtcgct gggggtgggc .gcaccctga cugtgaggc cag^gcaag gmgcacg, 34500 
ga<:ttcgtgaccg.ca=ccagctctgcagcaca.ccog.gacccagc.oa.ccaggccgc 34560 

a^caaacc, gttgccaggc gagaaaccag .oaccgcaca g«gtgg«g 34620 
,^gc=atm.cacc<xggagtgaggacagac^gatgaaaaccagcaaaagccct 34680 

ggaaac.oa.g.gaccctgccaa.gagggcggcca.gtgcattgcagcctggccg.cact 34740 

cctcgg.acgtg«.ggacaaaacg«ccgga««.ac.gagtg«.gat.aataac 34800 

atggaaggcc tggtct^t. gctgtgggag .gaagga^c acagccaggc ctgacatgat 34860 
g.gaacaagaacc.ggagtc.ogctgcctggg«gmtcc«gccc«ccact.agcaa 34920 

c.gtgtgactg.agccaggtcacttaamtg«agatcctgcctgcg«.cag.gga.c 34980 
,««gg«tcoaaggtggccaaacact.taaggcaaca.gtggtcgctaggotgcag 35040 
ggttgaaccc .ggccaccc cgcagggcgc cgWgctct g.ggcc«gc «.gcc«g 35100 
c^acaccg. gcccg«tgt gfcatgcag guaggagcg ggtcgtgan gccgacgatc 35160 
tcccgcacccgttcgg..tgacgoagU^gcga«a>atcuctggacagac.ggaatc 35220 

.gagcgggcc gacaagacta gcggccggaa ccgcaccCc atccagggco 35280 
acctgga^t cgtgatggac atcctggtg. tccac^otc ccgccaggat ggcccaa^ 35340 
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actgtatgcacaacaacgggcagtgtgggcagctgtgccttgccatccccggcggcc^^ 354OO 
gctgcggctg cgcctcacac tacaccctgg accccagcag ccgcaactgc agccgtaagt 35460 
gcctcatggt cccccgcacc tcactccctc gttagatcag gctggttctg ggagctgacg 35520 
ctgaaaggag cttctcatct ggggttcctg ggtgtacata gatggttggg taggttgtgc 35580 
actgcacaag ctgcatgatg ctacctgggg gtccaggtcc aggctggatg gacttgttgc 35640 
ttcatcagga catagataaa tggccaaaac tcctcagctg gaaggtcctg ggcaggatct 35700 
ttgggtgtga aaaccagtca caggggaagg gtgcttgctc atactgccag cacagtgctg 35760 
agtgctttcc atagcgctcg tttactcctc aagcctggag ggtggggagt agcatggtcc 35820 
catttcacgt acaaggaacc cgatgcacag agaggtgtgg caacccatcc aaggccatac 35880 

^ctggggtgggttgagccggggttgactgtggcaggctggctcaagagtccctgctcct 35940 
gaacccttgc caggcagcct ggcatcagct cggggaattt ttgccctgac ccttggaagc 36000 

aagtgggcctctttgttctcatgtcagtgatgagaagagtgactttcctatggcccctct 36060 
ggagtacagg tgtttcctgt tggcgggctc ttcccccatg acatcagcag cgagctggtt 36120 

atgattccctacgcagaacttgatagtttataaagctc^tgtcatccaggcccc^^^ 36180 
agtctcacgc agacctggtc gcaggcgggg ctggtcttgc ctgtcccagc tgcatggatg 36240 
gggaacttgaggcttgcaaaggttaaggggctgttcgaggcccacgctggc^^^^^^ 35300 
gcctgggccagagtctgggacttcccatgcctgggctgtcmggtcctgttg^^^^ 36360 
tccctccctg gggccatgac cttagagagc caaatggagg tgcaggtaac ccacggcaag 36420 
gaggggttgccatgactcagagtccccgtcctgtggccggcagtacctggtg^^^^^^ 36480 
tggatttcag accagccact gtagccx:gct gacggtgcgc tcgaagtgcc acagcttctg 36540 

aagccaggcaggactcaggccaggagactctgttagctgttgagagggagaggccaacgg 36600 
atgttctggttctgctagagagctggttcttcggatcctggtaccagtgcac^^^^^^^ 36660 
ggcccagcttgattctggggctgccttgtggtggcatgtgctgctcactgacaca^^^^ 36720 
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gccagggcac agco^toaaa atctccatca a.gaca.gta agaa^gaga ggaacctgg. 38160 
aaatagcaaa gtgcctmg ca.a«-t gg«ago« a.cccacaa. actgtgcan 38220 
cgtaaacgtt aatgctgoaa .aaa^ggc .«tcaccn gggaagatct ggagttggc. 38280 
.tgagtgtg gaagggtgta gogca«agt »g.gaaa cacggaagg aggangtgg 38340 
gaaa^caaat ggaaagttc. caccccaggc gtggagaaga g.ggg.ca.g gccccagcag 38400 
^agcccagg gagg^agag acggaggtg. gtgtgtgggt g.gacc.«c gcag..ccct 38460 
gccggcgm gmmgca «cgc«aa. gmocgtg gaggaaa«g «catgagca 38520 
aatgtgaaac cg.gCgtgc toaaattgtc c«auc«c attgca«gg aacagangg 38580 
«ttt«tt.««mtt«««mgaaatggagu*«ctctgtcaccagc 38640 
ctggagtgca gtggcatgat cttggctcac «caacc» gc«octa.g acaagtga. 38700 
„t cagcctcctg agtaactggg aUacagggc atgagccacc gcggccggcc 38760 
agamgcatt«gaaacaa«g«aggctggg.gcggtggc.cacacctgm^ca 38820 

g^agtggg aggccgaggc agg^gaua cctgagg.ca gggg«egag accagcCgg 38880 
c^ca.ggtgaaaccccg.c«ac.gaa.aucaaaaa«g<Stgg«gcgg 38940 

g^cctgm ^ccagcac .caggaggct gaggcaggag aattgcttga acccaggagg 39000 
cagaggag.gg«agcegaga.cacacca«gcac.ccagcctgggcaacaagagcaaa 39060 

„tc aaaaaataaa aaatagaaaa acaagtgctg tagcggaag. gagcacmg 39120 
cggagtcaggc«g.gtggc«gttc=acaaa«awtca.gg«gcc.caggc^ 3^l«> 

cctggagtct gcagcatggg gcacaa^gS «-«ag.g .agaa«cca ggacaggc« 39240 
ggotcctaagcagccttcttflacaaaaactgcagagcccgcotgutcgugcactttg 39300 

ggaggccgaa gtgggtggat cacgaggtca ggag«caag accagcctgg ccaacatggt 393«. 
g.aacc<xa.cU««aaa«aogaaaaOagagggtgtggtggcaogcgcctg.ag 39420 

tcccagcac .cgggaggc. gaggoagaa. «cttgaacc .gggag^ aggttgcagg 39480 
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gatctgagac catgtcattg cactccagcc tgggcaacag agcgagacgc catctcaaaa 39540 
aaaaaaaacc tacagagcca cacggcctct ttctccaccg agtgttggtg tgggagcttg 39600 
tgttattgtg gtgaaatctt ggtactttct tgaggcagag agaggctgag cgcctggaga 39660 
gactttcaca tgggtcgcca tgtccgccgt cggtttcgct gttgtgctcc ccatctgaag 39720 
gctggtgccg tccagacagg ctggacgccc ctttccacca gatccttcct cccgcagcag 39780 
tttctagtta cgttgfactg tgaggtctgt gtccttggtt gatggcaaaa gtcagccgaa 39840 
ttgaaattca gagccatgcc tggctccctg gagcttctct cctgggcagc tgtgatcatt 39900 
gcctctgctg tggtgtgggt ggtggaaatg gattcctttc atcttgcttg ctacaggtga 39960 
ctgtcacgtg gagtcctttg gagagaggga cgtgttaatt gatggatgtg gctcccatgc 40020 
tgagaaagctcctgggcgtacattgccttagagtttcattggagctgcgttcttttatgg 40080 
tgtctgctag gcagaagtga tgaagacttg gaagaaaacc cagaaggttt tccacttaat 40140 
ttggaaaatg tgcttttccc ctcctgtgtc ttttgctaag gtccagcctc ctgcagcctc 40200 
cccgctctgt ggactctggc tttgattctt tattaggagt ccccctgctc ccccaaaaga 40260 
tggtgtctaaattaicatccaattggccgaggttttgttttctattaattgtttttattt 40320 
tttattgtgg taaatttata taacataaaa tttgccattt taattgtttt gttattgttg 40380 
tttttgagac agggtctcac cccagtgccc aggctggagt gcagtggtgc gatcatggct 40440 
cactgcagcc tcagcctcca gggctccagt gatcctctca cctcagcctc tctagtagcc 40500 
gggactacaggcatacactaccacatctggctgattttttgtattttmmatt^^ 40560 
agacccgctatgttgcccaggaggtctcaactcctggactcaagccatcctcccacctc 40620 
accctcccaaagtgctgggattacaggcatgagccacaacacccagccattttaattttt 40680 
ttttttttttttgagatggagtctcactctatcgcccaggctggagtgca^^^^ 40740 
atcaactcac tgcaacctct gcctcccagg ttcaagcgac tctcctgcct cagcctcctc 40800 
ccgagtagctgggattacaggtgcccatcactatgcctggctaatttttgtattttttag 40860 
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ggagtgtctt ctctcgggct tgttgactgt gcccggtttt ccgcagttca ctggtgcaca 36780 
cataggcaca tagcaaaccg cacacacagt cgtgggtatg agtttcacta cattccacca 36840 
ccagtgttca ctaccattac ctgccttccg tcttaagtgt tcatcattta aaaataaatt 36900 
tattgggctg gacgcggtgg ctcatgactg ttatcccagc actttgggag gctgaggcgg 36960 
gcagatcacc tgaggtcagg agttcaagac cagcctggcc aatatggtga aactccatct 37020 
ctactaaaaa tacaaaatta gctgggcatg gtggggcatg cctataatcc cagctactca 37080 
ggaggctgag gcaggagaat ggcgtgaacc cgagaggcag agcttacagt gagcccagat 37140 
agcaccactg cagtccagcg tgggcaacag tgcgagactc catctcaaaa aaaaaataaa 37200 
taaataaaag aaaaataaat ttatgatcta tttcaaaaat aacacatgta ctttgaaaca 37260 
gcagagacac atatgacacg gagaatgaaa ttccccatag cgcaccccca agagacagcc 37320 
ctggtccccc cgtctttccc gtggacctcc agcggggcag atgctgagcc gcctgttgtc 37380 
gagtggcatg ctatcccgtc ctccagctcc tctgtggctt acagacaccc acctgcagcc 37440 
ctgtctttgc ctcctctagc gcccaccacc ttcttgctgt tcagccagaa atctgccatc 37500 
agtcggatga tcccggacga ccagcacagc ccggatctca tcctgcccct gcatggactg 37560 
aggaacgtca aagccatcga ctatgaccca ctggacaagt tcatctactg ggtggatggg 37620 
cgccagaaca tcaagcgagc caaggacgac gggacccagg caggtgccct gtgggaaggg 37680 
tgcggggtgt gcttcccaag gcgctcctct tgctggtttc caggctgctg cccctgtcct 37740 
tagcagaggg aggaaacaga ggatggctct gggtgaatga tgacttgggc ttcgattatg 37800 
tagtcacagg gtatgaccct gagatgcgtg gaaccccgag actgtgatta tatgtagaaa 37860 
ctgggtttcc ccgttgttta agtagtcatg gtggggtcag accccacagg acttttgtct 37920 
tttcaagaaa gaaaatggtc gtgtgtcatg caggggtagt tggtactggt taatccaggt 37980 
ttatccttta ttttgtggga actgtacagt catttctgct acaatgctgt atatgctctt 3 8040 
ctgaaagaca cctatgcaaa atcgcacagt aaaaatgaca caactcatag ggaaagcggg 38100 
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cagagacggg gtttcaccat gttggccagg ctggtcttga actcctaacc tggtgatccg 40920 
cccgcctcgg cctcccaaaa tgctgagatt acaggtgtga gccaccgtgc ccggcctttt 40980 
tttgtttttg agacagggtc ttgccctgtc acccagactg gagtgcaatg gtgggctctt 41040 
ggctcactgc agcctccgcc tcccaggctc aagttgtgca cctccacacc tggctaactg 41 100 
tattttatgt agagacagat ttcaccatgt tgcccaggct gggcttgaaa tggactcaag 41 160 
cagtccaccc acctcagcct cccaaagtgc tgagattaca ggcgcgagcc accgcaccca 41220 
gcccatttta cctattctgc agttgacagt tcagtggcat tcagtcagtt cacgaggtaa 41280 
ccatcactgc cattcatctc cagactactt caccttctcg gcagatgtcc gaaactgtcc 41340 
gcattgaaca cactcctcat ctccctctga cagccaccat tctactttgt atctctctct 41400 
gccttctcta ggtacctcat gtaagtggaa ttataccaat atttgccctt gtgtgactgg 41460 
cttctttcat gtgacatggt gtcctcaagg ttcatctgtg ttatagcctg tgtcagaatt 41520 
tccttcctta aagcctgaat aataacccgt tgtaaaggct gggcgcggtg gctcacaccc 41580 
tctaatccca gcattttggg agtccgaggt gggcagatca cttgaggtca ggagtttgag 41640 
accagcctgg ccaacatagt gaaaccctgg ctctactaaa agtacaaaat tagctgggtg 41700 
tggtggcgcg cacctgtaat cccagttact caggaggctg aggcaggaga atcgcttgta 41760 
cccgggaggc agaggttgca atgaaccaag attgtgcctc tgcagtccag cctgggtaac 41820 
agagtgagac ttcctgtctc aaaaaaaaaa aaaatcatcg gatggatgga cggaccactt 41880 
cttgttattt atccatccac gggtgctagg tttcttccac ctttggttgt cgtgaataag 41940 
gccactatga acatttcctt ccgtggtgaa ggttttgtac tagtgaggaa aaggcgtgtt 42000 
tgtggtgttg cataggattc tggtaagaaa gtttgcacta accataagta tttgtactac 42060 
attaaaatga aagctcaggg gccgggcgcg gtggctcacg cctgtaatcc cagcactttg 42120 
ggaggccagg gcgggcggat catgaggtca ggagatcaag accatcctgg ccaacatggt 42180 
gaaaccccgt ctctactaaa aataccaaaa aactagccag gtgtggtggc gggcacctgt 42240 
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ttcaagtgat cctcttgcct cagcctcctg agtagctggg attacaggtg cacgccacca 5 1960 
tacccagcta atttttgtat ttttagtaga gacggggttt caccatgttg gtcaggctgg 52020 
tctcgaactc ctgacctcgt gatccgccca cctcggcctc ccaaagtgct gggattatag 52080 
gcgtgagcca ctgtgcctgg ccattaggtg tgttttatca cccagcatca tgcagtttat 52140 
cttggtgaat gttctgtgta ctcttgaaaa gaatgtggat tctgctgttg ttgggtggag 52200 
tgttccagaa acatcaatta gatccagttg gttaatagtg ctcatcaggt tgtctctatc 52260 
cttccttcct gactgcctgc ttgagctgtc agttattgac aggggtgtgg agtctccaac 52320 
tctaatggtg gatttgttta tttctcctag tagttctatc tttttctctc cttctaccct 52380 
tgatcctctt ctccccctag ggcttcctgg tgttggtggt gggagagtgg ggtagtgaag 52440 
aacctggact ttagggccaa agaggccagg gttcaaatcc tggctctgtc acttcccagt 52500 
tgagtgaccc tggctggtgc ctgaatctct gtgagcctcc acttcctcct ctgtgaaatt 52560 
gagagcactt acctggcagg ctgtcatggg catcaagtaa cagggcactc cacctggacc 52620 
ctgacacgtg atgcacagga atgccagctg ctatgccatg ggtgtggcag tagtaataaa 52680 
gtgaccatct gtatcctcac cacagtgaag cctgtccagg gctttctctc ctatgccccc 52740 
atgcctccag gtggccttgg atcctgttgg ttctgtgctc tgctcagcga cctttctccc 52800 
gtgggagttc ctgggggttc agcttcatcc tacagacagc agcacacact ggctgtgcac 52860 

cctttttttt tttttttttt tttlttttga gatggagtct cgcttttttc gcgcaggctg 52920 
aagtgcagtg gtgtgatctt ggctcactgc aacctctacc tcctgggttc aagtgatttt 52980 
cctgcctcac cctcccaagt agctgggatt acaggctccc accaccacgc ccggctaatt 53040 
tttgtatttt cagtagagat ggtgtttcac catgttggcc aggatggtct tgaactcctg 53 100 
acctcaggtg atccgcccac ctcagcctcc caaagtgcag ggattacagg cgtgagccac 53160 
cacacccgga gtgccggttg tttttagcag tttgtcttgt tcctggagag actggctcct 53220 
gcccaggagc tcggggagta gggccgcggg gtgctgcctc acacctcgag tttggccgta 53280 
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attttgtgtg ccagcgcgtg gtgtgccagc gctatgcggg ggccaacggg cccttcccgc 60240 
acgagtatgt cagcgggacc ccgcacgtgc ccctcaattt catagccccg ggcggttccc 60300 
agcatggccc cttcacaggt aaggagcctg agatatggaa tgatctggag gaggcaggag 60360 
agtagtctgg gcagctttgg ggagtggagc agggatgtgc taccccaggc cctcttgcac 60420 
atgtggcaga cattgctaat cgatcacagc attcagcctt tcccactgag cctgtgcttg 60480 
gcatcagaat ccttcaacac agaggcctgc atggctgtag caacccaccc tttggcactg 60540 
taggtgtgga gaaagctcct tggacttgac cttcatattc tagtaggaca tgtgctgtgt 60600 
tgtccacaaa tcctcatgta ccctagaaat gaatgtgggg gcggctgggc tctctccaga 60660 
gctgaaggaa tcactctgta ccatacagca gctttgtctt gagtgcagct gggatttgtg 60720 
gctgagcagt tacaattcct acgtggccca ggcaccagga acgcaggctg tgtttgtaga 60780 . 
tggctgggca gccgcaccgc agagctgcac catgctggtt tgtatcacat gggtgaccat 60840 
ggtatgtcta agaaggtgga gtccctgtga ggtctgcagg tgcccccaca gctccaggcc 60900 
accttgagga ttgcctctgc ctgcccagcc ctgagttccc tctcccctgt cctgtcccac 60960 
tgtcacccca agccggcctc attgggagcc tgttggatgg cagggtatag atgtaacctg 61020 
attctctctg gggagcgggg ttatctggct tctcaagagc tcctaggagc ccacagtggt 61080 
ggcaccatca cagtcgcagc agcccccaga gaacgcggcc ctgtctgttc ctggcgtgct 61 140 
ctgtgctgcc ccgcctgggt tccctgcccc agtcgcaggc cccttggagg aggtaccatg 61200 
tgtctcccgt ttcacagatg agccccgggg agctcactct agtagtggcc agagaggcct 61260 
gcggctcagg gagcggggca catttccaac aggacacacc gccctggtct gagtctcgtg 61320 
ggtagtggga gcagaggaga gcgccctatg tctgtggggc ggcttggctg agcctggaag 61380 
ccacctgacc tcccccgtcc cttccctgcc aggcatcgca tgcggaaagt ccatgatgag 61440 
ctccgtgagc ctgatggggg gccggggcgg ggtgcccctc tacgaccgga accacgtcac 61500 
aggggcctcg tccagcagct cgtccagcac gaaggccacg ctgtacccgc cggtgagggg 61560 
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tggactcgag cgatcacct gcctcggcct ccccagg«c tggganaca ggcg«agco 63000 
acogtgcctg gccggggu «g.c«ctt atggcaccg ac^tggtgg gccctgggaa 63060 
ggaagtagca gaagagggtt ctcttggt. .cctggacag taa«gag.g Wggaggc 63120 
cccagggcotggcntgmagggacaaagggaacgguaccagaagccgagagttua 63180 

a^ccoactgcccttcttccctgccctgctgctgoaacccagct^accagccaggag. 63240 
gcaggaacc caagcagggc ccccgagcac acagcaggca gctcacgaa. .ctcmtcc 63300 
t^o^ ^mo^U gaggatcaa atcaggcaa. aagagatggc aCgagcagc 63360 
cagctaamttfaaatcacfflaagmaacca^gacteacccactuaaaaaggg 63420 
,acagacagtgggtMag.g»tcacaga.g.g.gcaaccc.caccacagttaa« 63480 

^gaacafl. Kctgcccct aaaagaaaa agcatgaag ccagctgttt ttaaattagc 63540 
aaagttam.gca«=ctttaaatatatgt.catgg.acaaaattcaaaaga.acagaag 63600 

agtcgcagt ccaaagagac .ccgccccca tgacgccaag caggactccc tgggaggcat 63660 
ggocccgc agtg««c ttctatgtcc ccccagggg. catctguca mgcaagca 63720 
ucaagagcg tggactttgl tttccaagcc agaagataat tgtagama tgtgcagttg 63780 
.gagaaagag cacagaccca tttatccc. gcctggmc ccccagtgct gcctgccatc 63840 
ttgcatp^^tttcattccuteataagcaagacactgataacgaucmoaccttattc 63900 

agattgacat aagtgttm tg«gn« tgagacaaac ttcc.ctg<c acccagtggg 63960 
apgoagtgg cacaatcaca g«cactgca gcc«=aaact cagggctca agcgattoo 64020 
cgcctcagt cccctcaagt ag«caga,g gcagg«.gc accatcatgc <=agg<=taa« 64080 
«aaamt ttgtggaggt gaggcctcac mamcc. gggcBgtct tgaactcc« 64140 
agcBaagtg atcotcctgc ccag^ caaagWa ggattacagg catgagccac 64200 
tgcgcctggg ogaeatatg tgtmcgta agcccgaaag atagcatag aagagtoaac 64260 
attgagccttgccttttgctgctaa£gatgta<aaaag«gctgt<ctgagcatttcgga 64320 
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gggggcttgg gctccaaact ctgactgtgt gtttgagtcc ggctgtggaa acctagccat 68520 
tgagatgccc cctcttggtg gctctgtcct cttaggatgg gacaagtctg tgaaggctgc 68580 
tgcagcaccc accgtagacc cctaatcgtg tgacgtcacc aggatggtcc gggctgctca 68640 
cttgccacag tggcctgttt gagcccggga agccaacggg gctgctcagc tggacaccag 68700 
ccccccgagc tgcccatgtt ggggtcacag gccccacctc cctggttggg gaggggcaac 68760 
tgagagtgtg gagaggtggg acccaggtgt gctggtctcc gcaggggctg gatcagagcc 68820 
tgggatgggc agggtgagcc tcctgacctt taacccagtg gtgtcaggca acgtggccca 68880 
cccgccagcc gcaccaggcc ccacccccgc aggtgaaggg gtgggatagg ctgggcctgg 68940 
gccaggacac ctctggacca cgcattcctc attgcttggg tccctggagc agcagggcct 69000 
cccgagtgtg gtgccgcctg ccacctagtg gccatttcca cgaactccca ggcctggctg 69060 
gggagccgga actgcagcct ccatttccac cccactccgg gtcgggccac ctccctgatg 69120 
cctcagtatt atatcaaact gtcacagtct gtcccacagc cttacagacc actgtctcca 69180 
gaatggtcac atccacactg ggcagcccag tctcgctagt tcctcgtccc acctcctgcc 69240 
tttgctcatg cccgtcctgc tctgggccca ccgcggacac atcttccccc cgcccgccgt 69300 
ctgacctcac agcagctggg ccccaagagg agtatcctgt cctgctgcac ttttctcaac . 69360 
acccggtgtt ggctgcacct tcccacccat tgcaggcccc tctgtgacag gacgggggct 69420 
cctaaacaca ccacagttcc gagtctgaac tcacacagtg ggatgcggcg tttctgggcc 69480 
acagttgggt gcaggtagcc tctgggagga tgggaggtca ggagccatct tgcgagtcag 69540 
gttgcttgaa ctoaggatgg aagtgttccg ggcccattgg ttgctgtatt agcctgttct 69600 
cacgctgcta ataaagacat acccaagact gggtaattgt aaaggaaaga ggtttaacgg 69660 
actcacagtt ccacctgcct ggggtggcct cacaatcatg gtagaagaca aggaggagca 69720 
agtcacatct tacatggctt cagggaacag acagcalgag aaccaagcga aaggggtttc 69780 
cxcttgtaaa accatcaagt ctagtgagat ttattcacta ccacgagaac agtatggggg 69840 
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ggctgctcag ctggacacca gccccccgag ctgcccatgt tggggtcaca ggccccacct 2940 
ccctggttgg ggaggggcaa ctgagagtgt ggagaggtgg gacccaggtg tgctggtctc 3000 
cgcaggggct ggatcagagc ctgggatggg cagggtgagc cicctgacct ttaacccagt 3060 
ggtgtcaggc aacgtggccc acccgccagc cgcaccaggc cccacccccg caggtgaagg 3 120 
ggtgggatag gctgggcctg ggccaggaca cctctggacc acgcattcct cattgcttgg 3 180 
gtccctggag cagcagggcc tcccgagtgt ggtgccgcct gccacctagt ggccatttcc 3240 
acgaactccc aggcctggct ggggagccgg aactgcagcc tccatttcca ccccactccg 3300 
ggtcgggcca cctccctgat gcctcagtat tatatcaaac tgtcacagtc tgtcccacag 3360 
ccttacagac cactgtctcc agaatggtca catccacact gggcagccca gtctcgctag 3420 
ttcctcgtcc cacctcctgc ctttgctcat gcccgtcctg ctctgggccc accgcggaca 3480 
catcttcccc ccgcccgccg tctgacctca cagcagctgg gcxccaagag gagtatcctg 3540 
tcctgctgca cttttctcaa cacccggtgt tggctgcacc ttcccaccca ttgcaggccc 3600 
ctctgtgaca ggacgggggc tcctaaacac accacagttc cgagtctgaa ctcacacagt 3660 
gggatgcggc gtttctgggc cacagttggg tgcaggtagc ctctgggagg atgggaggtc 3720 
aggagccatc ttgcgagtca ggttgcttga actcaggatg gaagtgttcc gggcccattg 3780 
gttgctgtat tagcctgttc tcacgctgct aataaagaca tacccaagac tgggtaattg 3840 
taaaggaaag aggtttaacg gactcacagt tccacctgcc tggggtggcc tcacaatcat 3900 

6 

ggtagaagac aaggaggagc aagtcacatc ttacatggct tcagggaaca gacagcatga 3960 . 
gaaccaagcg aaaggggttt ccccttgtaa aaccatcaag tctagtgaga tttattcact 4020 
accacgagaa cagtatgggg ggaaccaccc ccatgattca atcatctccc actgggtccc 4080 
tcccacagca cgtgggaatt atgggagtac aattcaagat gagaittggg tggggacaca 4140 
gccaaaccct atcggttgcc aacatttaca gtaacagtgt taggigaaca gttgtccagt 4200 
ctcctgtttt gtcggacact gtttctagca ccttccaggc agaatctcat gtatccttca 4260 
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ctttcgaaat gggtactatt tcatccccac ttttatcaat gagaaactaa agctcgaaga 4320 
ggtcaagtaa gttcctggcc aaggtcagct agcaggctct agaggcctcg ttctccttag 4380 
aggcagcctt gccagggccc aggcttggca ggctgcaggg caggtgcggg catgcccatg 4440 
gtagaggtgg gaccattgag gctcagagag ggtaagtgat gagccctggc gacacagcgg 4500 
ggtgggtcca gagtccggcc tgcatcttct ggagctggcc agtggacagg cctttcccgt 4560 
tcacagcccc ggggctgctg tgcccaccag ggcggatgtg cctaccgaat cccactcctc 4620 
tgtgtgtgtc cctttcaggc cctacatcat tcgaggaatg gcgcccccga cgacgccctg 4680 
cagcaccgac gtgtgtgaca gcgactacag cgccagccgc tggaaggcca gcaagtacta 4740 
cctggatttg aactcggact cagaccccta tccaccccca cccacgcccc acagccagta 4800 
cctgtcggcg gaggacagct gcccgccctc gcccgccacc gagaggagct acttccatct 4860 
cttcccgccc cctccgtccc cctgcacgga ctcatcctga cctcggccgg gccactctgg 4920 
cttctctgtg cccctgtaaa tagttttaaa tatgaacaaa gaaaaaaata tattttatga 4980 
tttaaaaaat aaatataatt gggattttaa aaacatgaga aatgtgaact gtgatggggt 5040 
gggcagggct gggagaactt tgtacagtgg agaaatattt aiaaacttaa ttttgtaaaa 5100 
cagaactgcc attctttcgt gccctgtgtg catttgagtt gtgtgtcccc gtggagggaa 5160 
tgccgacccc cggaccacca tgagagtcct cctgcacccg ggcgtccctc tgtccggctc 5220 
ctgcagggaa gggctggggc cttgggcaga ggtggatatc tcccctggga tgcatccctg 5280 
agctgcaggc cgggccggct ttatgtgcgt gtggcctgtg ccgtcagaaa gggccctggg 5340 
cttcatcacg ctgttgctgt tcgtcttcct cagattctta gtcttttttt tttttttttt 5400 
ttttttgaga cggagtcttt ctctgtcatc caggctggag tgcagiggta caatctcagc 5460 
tcactgcaag ctccgactcc caggttcaag tgagtctcct gcctcagcct cccgagtagc 5520 
tgggactaca ggtgcgcgcc accacacccg cccagctaat ttttgtattt ttagtagaga 5580 
tggggtttca ccatgttggc caggatgatc tcgatctctt gacctcgtga tccgcccacc 5640 
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aaaaaacaag acccaaccat ctcttgcata caagaaacac actttaccta taaaaacaca 7080 
ctaggccagg tgtggtggct cacacctgta atcccagccc tttgggaggc ctgactggca 7 140 
gatcacctga ggccaggagt ttcagaccag cttgaccgac atggcaaaac. cccatctctc 7200 
ctaaaaatac aaaaaaacaa aaaaaagaaa aaggctggaa gtagtgatgt gtgcctgtag 7260 
ccccagctac ttgggaggct gaggcaggag aattgcttga atccgggaag tggaggttgc 7320 
agtgagccag gatggtgcca ctgcactcca gcctgggtga cagagcgaga ccctgtcata 7380 
^aa:^aaaaaa gaaaagaaaa gaaaaacgag aaaaacaaac acaaaattag tagaagaaaa 7440 
gaaataataa agatcagaac aggccaggct catgggcaca gtggctcaac tcctacctgc 7500 
tcaggagttt gagaccagtc tggccaacat ggcaaaaccc catctctcct aaaaatatga 7560 
aaaaaaaaaa ataggctgga tgtggtgatg tgtgtgtgcc tgtagcccca gctacttggg 7620 
aggctgaggt gggagaatca cttgagccca ggaagtggag gctgcagcga gtcatgaatg 7680 
caccctgcac tctagctggg taactggagt gagattctgt ctcaaaaaag caaagaccag 7740 
agcagaaata aatgaaatgg aaatgaagga aacaatgcaa aatgatacaa aaagtttttt 7800 
cgaaaagata aacaaaatca acaaaccttt agccagatta agaaaaaaag agagaagacc 7860 
caaataaata aaatccgaga ttaaaaagga gacattacca ctgataccac agaaattcaa 7920 
aggatcatta gaggcaacta tgtgcaacta tatgctaatg aactggaaai cctagaagaa 7980 
ctgggtaaat ttctagacac atacaaccta tcaagattga accatgaaga aatccaaaac 8040 
ctgaacaggc cgggcacggt ggcttacgcc tgtaatccca gcactttgga aggcctgaga 8100 
tcaggagttc gagaccagcc tggccaacat ggtgaaaccc catctctact gaaaaaatat 8160 
aaaaattagc cgggcgtggt ggcgggtgcc tctaatgtca gccactcggg aggctgaggc 8220 
aggaaaatca cttgaacctg ggaggcatag gttgcagcga gccgaggttg caccactgca 8280 
ctccagcctt ggcgacagag ccagactcca tctcaziaaaa attaaaataa caaaaacctg 8340 
aacagaccaa taacaagtaa tgcgatgaaa actgtaataa aatgtttccc aacaaagaaa 8400 
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gcccaggaac aaatggcttc actgctgaat tttaccaaac attttttttt ttttgagacg 8460 
gagtctcgct ctgtcgccca ggctggagtg cagtggtgta acctcggttc gctggtaact 8520 
tatgcctctc aggctgcaag tgattttcct gcttcaggcc ccccgagtgg ctggaaatta 8580 
gatggtactt gtcaaacaag gcctggctaa atttctatat ftccttcaag tagaagatgt 8640 
gcttccaaca aaggttgggt tacggctggc ttctgaaaat cttggatttc aaggctcccc 8700 
aaaag 8705 

<210> 11 
<211> 66933 
<212> DNA 
<213> Homo sapiens 

<400> 11 

tataatcaag cgcgttccgt ccagtccggt gggaagattt tcgatatgct tcgtgatctg 60 
ctcaagaacg ttgatcttaa agggttcgag cctgatgtac gtattttgct taccaaatac 120 
agcaatagta atggctctca gtccccgtgg atggaggagc aaattcggga tgcctgggga 180 
agcatggttc taaaaaatgt tgtacgtgaa acggatgaag ttggtaaagg tcagatccgg 240 
atgagaactg tttttgaaca ggccattgat caacgctctt caactggtgc ctggagaaat 300 
gctctttcta tttgggaacc tgtctgcaat gaaattttcg atcgtctgat taaaccacgc 360 
tgggagatta gataatgaag cgtgcgcctg ttattccaaa acatacgctc aatactcaac 420 
cggttgaaga tacttcgtta tcgacaccag ctgccccgat ggtggattcg ttaattgcgc 480 
gcgtaggagt aatggctcgc ggtaatgcca ttactttgcc tgtatgtggt cgggatgtga 540 
agtttactct tgaagtgctc cggggtgata gtgttgagaa gacctctcgg gtatggtcag 600 
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taaataaaaa tttattaaaa cattcatcac agccagccta gtgggtgtcc catgtggctt 1 1700 • 
tgcctcgcat ttccctgata actaggatgc tgagcgtctt gtcccaggct tgccacacct 1 1760 
cagcactttg agatacgtcg cacagtcccc atttgcgaac gagaaatgag gtttagggaa 11820 
cagcagctgt gtcatgtcac acagcgagca gggggtctct gagccgtctg accccacagc 1 1880 
cgaccaagct ccaatcctta ccgcctccta gtgttgtgga tgtagcccag ggtgctccca 11940 
catttttcag atgagaacac cgaagctcaa aacaggagcg ttttgtccac attggataca 12000 
cgatgtctgt ggtttggtcc tgaagtcact ttatatctca gtggtccaga ctggagtagg 12060 
acagggggtt ctggggaatg gggaaggtgt ctcaggtgaa aggaaggaat tccagattct 12120 
ccatactgtc cttgggaagt tagaagactc agagggtctg gcaaagtcag acaaagcaag 12180 
agaaatgcag tcaggaggaa gcggagctgt ccaggaacag gggggtcgca ggagctcacc 12240 
cccaggaact acacttgctg gggccttcgt gtcacaatga cgtgagcact gcgtgttgat 12300 
tacccacttt tttttttttt ttgaggtgga gtctcgctct cttgcccagt ctggagtgca 12360 
gtggcacgat ctcggctcac tgcaagctct gcctcccggg ttcatgccat tctcctgcct 12420 
cagcctcccg cgtagctggg actacaggcg cctgccaccg cgcccggcta atttttgtat 12480 
ttttagtaga gatgggattt cactacatta gccaggatgg tctcgatctc ctgacctcat 12540 
gatccgcccg tctcggcctc ccaaagtgct gggattacag gcgtgagcca ccgcgcccgg 12600 
cccgatttcc cactttaaga atctgtctgt acatcctcaa agccctatac acagtgctgg 12660 
gttgctatag ggaatatgag gcttacaggc catggtgctg gacacacaga agggacggag 12720 
gtcaggaggt agaagggcgg agagagggaa caggcggagg tcacatcctt ggctttcaaa 12780 
atgggccagg gagagacacc ctctgagcat ggtaggacag gaaagcaaga ttggaacaca 12840 
ttgagagcaa ccgaggtggc tgggcgtggt ggcttacgcc tgtaatccca acactttgga 12900 
aagctgaggt gggtggattg cttgaggcca ggagttcaag accagcctgg ccaacatggt 12960 
gagaccccgt ctctactaaa tatacaaaaa ttagccaggc gtgatggtgc atacctgtaa 13020 
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tcccagctgc ttgggaggct gaggcaggag aattgcttaa acctgggagg cggaggttgc 13080 
agtgagccga gatcccgcca ctgcactcca gcctgggcca cagagtgaga ctccatctca 13140 
aaaaaaaaaa aaaaaaaaga taaaaagacc aaccgaggaa ttgaagtggg ggggcgtcac 13200 
agtagcagaa gggggatcgt ggagcaggcc accctgtggt catgcactgg aagctcatta 13260 
cctgacgatt tggagctcat cactgggggc ctaaggagaa tagatactga aggatgagga 13320 
gtgatggcgc ggggcacggg tgtctttggt ggccagaact tggggactgc tggggtgcct 13380 
cactgcaggc cttctcagcg ccctttatat gcttacacag gctgtttcta agagggggat 13440 
acattgcata agcgttttca gactacctca tcatgggtcc ctttctttac cctctgtggc 13500 
cctggtggcg cactctctgg gaaggtgcag gtggatgccc agacccgccc tgccatccac 13560 
ctgcacgtcc agagctgact tagcctcgag attgctgctg gcacctcctg ccccgggaca 13620 
cctcggatgt gcccgtggag atgctggctc tgtgttttct gctggagttt ggtgcgtctt 13680 
ttcctcctgc aagtggccac cgctcttggg tatgtcctca ggcttctgcg agtcatggct 13740 
gcttctcagg tccttgccca gcgccaggag caaaccctcc tggcactttg ttcaggggtg 13800 
gatgcgccag tgttcctgct gtggaccccc atctcacatg agggtcttgg gcctgcaggc 13860 
tcgttcagga aacacccgct gagtacgcag tgtgtgccag ctgtgtccca ggcaatggcg 13920 
gggacagtgg ctgctgctgg ggttgtggtg gcttctgggg actctgggga cagctgaggt 13980 
gcaaggagcc acggctcctt gaggatgcag ttggactcca ggtggaaggg atggttgggg 14040 
gaggtataaa tggggtcagg gaggagacac atttggaaca atgggaacat ttttaagatg 14100 
ctatgtcggg aggcaacaag gtggccaacc caggtgctga ggagcccaca ccagccctgg 14160 
acgtgttttg ccgctcacct ttgctgggga gtggtgggag agaggattcc gttccacgtg 14220 
gtggtgtgcg cagctgggct gtgtggagct gggcgctagg aggaaggtgc tttctgcggg 14280 
gctagccggg ctctgccttt gaacacaatc aggctccagg ttttcagcat ccagtgcatg 14340 
agaggacttc acgggcagct gtggctgatc ccttgatgaa ttgggagaag aacaaaggtc 14400 
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tatgaaa.gaggmcatgtagatggoattagagacgeccaca.cagattmoagag.gg 14460 
agcggagacg gcgga.ggg. «gggaggoc co.oo.go., gcCtgaotg .gaoagctgt 14520 
ecgggaatc agottocagg cogooooagc agoCgaCg acaoaoacag gggttttago 14580 
ccca.co.go gaooagcg. tgcoatcato agtgaoagot gggagtggcg g«g..ccag 14640 
eeo,gggoaoootococaco.gotggggoocacocagggoagtootgacaoc.aoaggtt 14700 

gottggagco goatocgagt cotgoocoao caogtgtgaa gcoogag.gg «g.gggctg 14760 
agg.ococ.gattgca.ooccact.ooc«o«ot.oaoa,agc.gco.ctte.oaoog. 14820 

mtocagcc tocgggou ggaaucoag .gttg^Cg go«gccco aggaoacctt 14880 
augoccte ttoc.gag.0 mgagooocg gggg..ggaa g»o.ggoco o.gggaoaoc 14940 
.gcagcoaoac.cagc««cc.g.gagoc.coagoawccco.caggaocaagccotc 15000 

aogaoago «ooocgooo acCgggCo agocagggga aggoCggct gggagogte. 15060 
«co.c«ooc.gcoot«:tccoc.ctaccctgoocttctotoctotgocccgooatggo 15120 

.gtgocaoaa gaca.ggc.g tgtgtgaaag tggoagggto tggoa«=« 15180 
g.gggtoto. gaggoocaog c«:oag.goc aconocoa ooogo«goo g.gocoKat 15240 
go.Egagggaoagcc^gcoo«ooogaaoocoagoccca.gtgoooagotgooocogg 15300 

,cc«oooc.ggaagoogggg«.o.ooagcog.atgocatgg«gggaca.cc«ctt 15360 

cottggoct. ocagggaagg .cCctttoo aaatggcgao aoctggtcoo tgccggagg 15420 
o.ggaagotgttgoco«g.atgcccoKoagggte.g.gcgo.cggttggccogagtto 15480 

ocatoaoog. oatoatoaoc atcatoattg toatttogc. tgK^tgtgag ooggoCggt 15540 
ccooagagc agagacooto «aggtocag octgagttgg ggtotocgtg otgaoocotg 15600 
aoggggaoK aggaog.aco aggtctggg. caggagtgac ccccaaacct cgtgc<«<t 15660 
gaoaggcaoc oagactttt gomgtggg WSPsao aKaottaoa gogggagtga 15720 
.gggacaggg K.g.tggo. gcaotgtgot oocagggato tggggagagg cu.a.coo. 15780 
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gggctttggc actgcagagc tgtgtgtgtt tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg 15840 
tgtgtgtgtg tgtgtgtgtg tttgcgtgcg cgcacatgtg tataagatct ttttttatta 15900 
catgaagcaa gataactgtt gctgtttcct tttgggtttt gtgttcaaca gagtggggta 15960 
cttcttccct cagacaacag aactctcccc tttaaacacg tgctgtcaga gggtgggtct 16020 
tgggctcatg tctgtttgca cagccgagtc agaggaaaca cagggttctt cataaaaaca 16080 
ctgcacagca ggcgactgtc cagagtcagc ctgcaggacg gcagcagccc tgcccctcag 16140 
agcacagcta gggtgggctg ctttgggatc tcccgtcatt ccctcccagc tggcagccgg 16200 
cggccggccc attccttggt gtgctggtca ggggggcgtg cgcctgctct gctcaccctg 16260 
ggaatgggac agaagctggc agctcggaga ggacagggct ggacccttgg gtggcctctg 16320 
gctggaccat ctcattgtcc tcagacacag cctctcgggt ctagtttcat ttcctgaaaa 16380 
acaagtgcac agaactagag caggagtcga gagctacggc ccccgggcca gatccagccc 16440 
tgccacctgt tttcacacca tgctcaagct gagtgggttt tacatttttt aattacttga 16500 
aaaaaaaaaa gccaaaggag gtttcatgac ccatgaaaat tatatggaat tcaaaaaaaa 16560 
aaaattatat ggaattcaaa tttcagtgtc cataaataat ttcttgagac agggtctcgc 16620 
tctgtcacccaggctggagtgcagtgctatggcatggctcgctgtacccttgacctccca 16680 
ggctcaagcgatcctcctgtctcagcctectgagtagctgggactacgggtgtgtgccac 16740 
caagcccggctaatttttttttaattttagtaaagacagggtctttctatgttgcccagg 16800 
cttttctgga actccatctt ggcctcccaa agtgctggga ttacaggctc gagccacgga 16860 
gcccagcctg tttttgtttt ttcactgata aagttttgcc gggtgtggta gtgtgtgcct 16920 
ctagcgattt gggaggctga ggtgggagga tcgcttaagc ccaggagttt gaggctgggc 16980 
tcaagtgatc aggaggtgaa ctatgatcat gtcattgcat tccagcctgg gtgacagagc 17040 
aagaacctat ctcttaaaaa tatatattta aaaagtattg ggtgtggtgg ctcacgcctg 17100 
tggtcccagc tacttaggca tctgaggtgg gaggatggct tgagcccagg agtttgaggt 17160 



wo 01/92891 PCT/USOl/16946 

189 

tgcagcgagc caagatcgtg tcactacact ctagcctggg tgacagagcc cagaccctgc 17220 
ctctttaaaa aaaaaaacca aaaaacatgt attggaacac agccatgcct gttcagtcac 17280 
gtgctctcca tgctgctttc tgctccagag acccttatgg cctgaaagct gaaaatattt 17340 
tctatccttt acaaaaaagt ttgctgacct ctgtcctgga aaattcatct cccaagttct 17400 
cttccggcac tggcgttcct gggtgtccta aatttggccc ctgttatttc tgaactctgt 17460 
tttggctctg ttccctccca ggagccagga caggcacgtt ctctgcatct tgtcccctga 17520 
cgcccagagg cttggctcgg ctcaggcatt cttggaaata tctggctcca ggaaaggcag 17580 
aggcctcctg agtcagccca gagggaacct gc(xcaggtc tgggggaggc ctgacccagc 17640 
agagtggctt ttgccgatgg gttgggccgg tcaagatgtg ctgaaagttg tcctcagaag 17700 
gccactttgg gattccttcc tccagtatta gagcaactga gagctgctca ttgcaagcct 17760 
gatgttttcc cagttggccg ggtccaccgg gtgccctggg attctgggat ctgggtggaa 17820 
agtagggggc ttgggggagt gtcctgggtt ctggaatcca ggtggcaagt ggtgaggttc 17880 
agggagtggc ttctgagcca ccataggggt ctctgtggga ggctctgccc atccaggaga 17940 
ttccgcaggc cctgccggcc cagagccagc gtcttgcgct tgccgaggct acagccagcc 18000 
ccagccgggt ggaacagccc gtcgcctcct ctcactttgt tttggggcca cctgggagtg 18060 
tggagcaagg gtagagaggg aggaagtggc tgccggccgc tgcccagcac ccttgtttgc 18120 
cttgggccct ctgtgggctc ctttttattg ctcttcaatg aagccaggga aatggacttc 18180 
cttgcctcac ttcagttcaa catgtctgga agtttggtat taaaattaag aaagtgtgga 18240 
aatagagcaa gaagagaaaa atctctccaa gagataatag tgacctctga gctgggcgcg 18300 
gtggctcacg cctgtaaatc ccagtacttt gggaggctga ggcgggcaga tcacctgagg 18360 
tcgggagttt gtgaccggcc tgaccaagat ggagaaaccc cgtctctact aaaaataaat 18420 
aaataaataa ataaataaat acaaaattag ccaggcatgg tggcgcctgc ctataatccc 18480 
agctaaggca ggagaatcgc ttgaacctgg gaggcaaaggjtgcagtgag ccaagatcac 18540 
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gccattgcac tctagtctgg gcaacaagag tgaaactccg tctcaaaaaa aataaataaa 18600 
taaaaaataa aaatagtgac ctctggccag gtgtggcagc tcatacccgt aatcccagca 18660 
ctttggaagg aaggccgaga tgggcagatt gctttagcac aggagtttga gaccagcctg 18720 
gccaacatgg tggaacccca tctctacaaa aatagaataa aatttaagag gtaatagtga 18780 
ccttttggta gatcgaaacc tggattgctt tctttttcta aatgctgatt cttttctttg 18840 
tggtgtttgt gttctgtgcc gatgtcxxtc ccccagccct gttattgtga gtggaagaag 18900 
gggaaagggt tcgcccgcta ctgtgagccc ctcctctcac gctgggtgtc cttggagaag 18960 
cctgcacttc ttcattgtac gccagggctg ggtccctccc tggagtggtt ctgtgctgct 19020 
gggatggggc caacccctca gatgttttct gagtgtcaca cacaggtgtg tgcattcatg 19080 
gcctttgcgt gtcttcctgt tgtggaggca aaaatgtgaa gaaccctaga tgattttggg 19140 
accagggctc catcacctgc tgttcattgc acaccggagc atccaggcat gggtggagag 19200 
ctcagacttc caggcacggt cgcaggggct ggtctaacca tgttcccgcc cgcctgctcg 19260 
tcagaaccgc ctgttgggag ctgttatcat gataccatac ctgggccctg ggctatccga 19320 
ttctgactta attgctccag gttggggcca ggccgttgtt tgctgttttg ttgtttcttc 19380 
tgtgacgtta gccactgggc taatctgagc ccctcagtta caggtggaga aactgagacc 19440 
catgggggtg caaggacttg ccgaggaccc agagcccctt gggggcagag ctgaggcggg 19500 
gcctggcttt gggtcccaga gcttccagtc cccttcccgc tctcctaaca gctttttttt 19560 
ttgagacaag atctcaccct gtcacccagg ctggagtgca atggcatgat ctcggctcac 19620 

I 

tgcaatcttc gctagctgcg ttccagcgat tctcctgcct cagcctcccg agcagctggg 19680 
attacaggtg tgtgccgcca tgcccagctc gttttttttt gtacttttag tagagatagg 19740 
gtttcaccat gttggccagg ctgatctcga actcctgacc tcaaatgatc cgcctgcctc 19800 
ggcctcccaa agtgctagga ttacaggctg ggatcacact gtgcctggcc ctagcagctt 19860 
tgtcctgtgc catccaacaa cagatgaccg aagtctttgt ttcttaacat gcattccatc 19920 
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tgccttacag ttttgccacc tgcaaaacag aggacttgtc gcttttctgg taagctggaa 19980 
atgtaatctg gtagcaggag gcctgtggaa gcttgccttt aatggccttg tgtctctttc 20040 
atcctgtcct gagagccgga gaacttggat gttgcaccta actcaacctt cctgttaaca 20100 
tacagttctg caggctcatg gatcatcaga accacgtcct atctcacgcg gctgtatgct 20160 
tccgttggtt caggtgtttt taccttgaca gtattttctc ctcggtggct tttgcggtgg 20220 
ttgcttttaa tcagcattga ctcttcaaga aaaatattta gctgctacat ctcagaggag 20280 
acagggtgga aagcatctga gacctgcagg ctcagactta gaaccagaag tgccctcaga 20340 
gttcatccgg ccctgaccca gcgggaaatg agttcacaga gaagcgggag aactttgccc 20400 
caggccctgc cgttgctcat aactgcccca ggtccttaca tttgctccag gtcctgcccc 20460 
aggccctgca gttgctcata actgccccag gtccttatat ttgctccagg tcctgcccca 20520 
ggtcctgcag ttgctctgtg tggtgggtgt gatctggagc cctccgccca ttgctgcacc 20580 
tggggcaggc attgctaatt gatcccagga ctccttcctg cggagcacgc cctggttctc 20640 
caggcagccg ctgcctgtca gcctgcagtg gttcgggaga ggacacctgc ttgcctggtc 20700 
tgttccaaat cttgcttctc atcccagcac aggtaggggg tgctatggga aagggatcct 20760 
cagttggccc tgtcactgct ctatcagctg gggacgtggc atcctagtga aaacatcatg 20820 
gccgggcgcg gtggctcacg cctggaatcc cagcactttg ggaggctgag gagggtggat 20880 
cacttgaggt cagaagttcg agaccagcct ggtcaacatg gtgaaaccca tctctactaa 20940 
aaatacaaaa attcgccagg tgtggtggcg ggtacctgta atccgagcta ctcgggaggc 21000 
tgaggcagga gaatcgcttg aacctgggag gtggagcttg cagtgagccg agatcttgcc 21060 
actgcactcc agcctgggca acagagtgag acgctgtctc aaaatctcaa acaaacaaac 21 120 
aaacaaaaaa caaacaaaca aagcgtcatt tatccagcac ccctggggaa ccatgctacc 21 180 
tggtgtttta tggtacctgg caaggtgcag gtgaagttgc tgctcttggg cattgaaccc 21240 
gtcttgtttg gggcagctca ggccccaggc agggtccggg ttggctctcg ttggtgtggc 21300 
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cctggcccat ccagacctat atttctgccg tcctgcaggt gatcaatgtt gatgggacga 21360 
agaggcggac cctcctggag gacaagctcc cgcacatttt cgggttcacg ctgctggggg 21420 
acttcatcta ctggactgac tggcagcgcc gcagcatcga gcgggtgcac aaggtcaagg 21480 
ccagccggga cgtcatcatt gaccagctgc ccgacctgat ggggctcaaa gctgtgaatg 21540 
tggccaaggt cgtcggtgag tccggggggt cccaagccat ggctcagcca tgcagacttg 21600 
catgaggagg aagtgacggg tccatgcctg ggcataagtg ttgagctcag gtgccccgac 21660 
ctggggaagg gcaggacagg aaaggtgaca gtatctggcc aaggacagat gggaagggac 21720 
caagggagct gattagggag tggttatgga ctaggaatgt cggtaacaat ggttagaaag 21780 
tgactaacat ttgttgagca cctgctgtgt gcccggccct ggccgggagc cttcgtgccc 21840 
acagtgaccc cgtctgcaaa tgtagttoct tgccctactc gcactgggga gcaggacgca 21900 
gagccgtgca tctcacaggt gccaagctca ggactccctc ctgggtctgc ctgggctggg 21960 
ctgtgcttgt tgcccctgtg gcccacgcat gtgcaccttc cacctgaaag ccaggatett 22020 
caggacgctc cccgaggagg tcgttgtctg gcacaatgat ttgtctcttc ctgaaaaggt 22080 
gacagagtta cactggagag agcagcatcc aggtgcggca gggacaggcc tggggctcgc 22140 
gggcagggac tctgtgtcct gccggggtcc cacactgcac ctgcttgtca gaggcactca 22200 
gtcaatcttt gctgatgaag gatgagagga cagaggacgt gatgcttgct gctgcattgc 22260 
ctgcagtcct gggtgagatg cccgggttga ctctgctgcc cgtcgggtgg atgtgatgtc 22320 
agatccccgg ctttaaaata cgagggagct gggaattgag ggagcaggtt ggggcagaaa 22380 
gcacagcccc gtggaagcct ggagctgagg cagtgtgggc gacccctgga gcagtgagtg 22440 
cttccttcat ggccttcatc gcaccctgca gtcctcatgt aggggatgcc atccatgaat 22500 
ttagttttcc cagcctcctt taaaaacgcg ttcatgctgg ggccggggca gtgcagtggc 22560 
tcacatctga aatcccacca ctttgggagg ccgaggcggg tggatcatga ggtcaggaga 22620 
tcgagaccat cctggctaac aaggtgaaac cccgtctcta ctaaaaatac aaaaaattag 22680 
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ccgggtgcgg tggcgggcgc ctgtagtccc agctactcgg gaggctgagg caggagaatg 22740 
gcgtgaaccc gggaagcgga gcttgcagtg agccgagatt gcgccactgc agtccgcagt 22800 
ccggcctggg cgacagagcg agactccgtc tcaaaaaaaa aaaaaaaagt acaaaaaaaa 22860 
aaaaattagt ctgggtgtgg tatcacgcgc ctataatctc actactcgag aggctgaggc 22920 
ggagaattgc ttgaacccag gaggtagagg ttgtagtgag cccgtatcgt accactgccc 22980 
tccacctggg caatagagcg agactctgtc tcaaaaagaa aaaaaaaaaa agaacattta 23040 
tgccaggtgt ggtggctcat gcctgaaatc ccagaacttt ggaagactga ggcaggagga 23 100 
tcacttgagc ccagaaattt gagagtgtct tccctgggca acatagagag acctcatctc 23160 
taccagaaaa aaaaaaatta gcccggcatg gtggcatatc cctgtggtcc cagctactta 23220 
gggggctgac gtggcaggat cacctgagtc tggaggcaga ggttgaagtg agctgagatc 23280 
atgccactgc actccagcct gggtgacaga cagagaccct gtctcaaaaa aaaaaaaaaa 23340 
aaaaagcatt tactatccac catggaaggt gagactgacc tgtgagtgat tgttcaaaga 23400 
acaaaaaata aaccccagag ataagacaaa agggtgcctc catgggggtg tgatttaaag 23460 
ctgagaaatt gggcttcttc cccctcccct ctcaccccgt ggtttgctaa aggagatggg 23520 
aaaaaggatt ctttttttgg ctgaaatatt taacactaaa ttaaagccaa ttttaacagc 23580 
actttggttg atgagtgaaa ttaacagact ggccaiaaaat aaacgaacgg tctgtactat 23640 
gtgaaaaaga ggcagctttg gccatgctgg gccaatgtga gttttcaggg ttgctgggaa 23700 
tgtctgtgaa tcggaggaag ggcctagctg ggactctcag gagccaaggc cctgaggggc 23760 
aacttgcctg gtccctgccc tgaggcgttc actgctttct tcctgggcca gatcacaggc 23820 
ccggaggctg gaccactggg ctggcactct tgccgagctg ctccctgact tcctgaccat 23880 
gctcctttca gcagccttgc tgcactttag tttccttgaa tgaaaaatgg ggatgagaat 23940 
agctcctacc tccaaggtga atggagtgag ttcggacagg tgactccctg ggaccagtgc 24000 
ctggcgcctg acaaggtcca gtcagagccc gcactgctgt tactgatacc cttggctgta 24060 



wo 01/92891 

194 

ccaggggaga acttggttgc cattgccagg tgttctccca ccacccccac tactgtccct 24120 
gtttgatgtg tggcgggaat aaagctgtgc acattggagc ttttggcaca tcctggcttt 24180 
caggtgaaag gtgcgtgtgt gtttgagggt ttagcctggc caacccagcc atgaggtcgg 24240 
acctgacctg ggggtgagtc ctgagctcgg cacccctgag ctgtgtggct cacggcagca 24300 
ttcattgtgt ggcttggccg cacccctttc cctgctgggc tgttgatgtt tagactggag 24360 
cctctgtgtt cgcttccagg aaccaacccg tgtgcggaca ggaacggggg gtgcagccac 24420 
ctgtgcttct tcacacccca cgcaacccgg tgtggctgcc ccatcggcct ggagctgctg 24480 
agtgacatga agacctgcat cgtgcctgag gccttcttgg tcttcaccag cagagccgcc 24540 
atccacagga tctccctcga gaccaataac aacgacgtgg ccatcccgct cacgggcgtc 24600 
aaggaggcct cagccctgga ctttgatgtg tccaacaacc acatctactg gacagacgtc 24660 
agcctgaagg tagcgtgggc cagaacgtgc acacaggcag cctttatggg aaaaccttgc 24720 
ctctgttcct gcctcaaagg cttcagacac ttttcttaaa gcactatcgt atttattgta 24780 
acgcagttca agctaatcaa atatgagcaa gcctatttaa aaaaa5.aaaa gatgattata 24840 
atgagcaagt ccggtagaca cacataaggg cttttgtgaa atgcttgtgt gaatgtgaaa 24900 
tatttgttgt ccgttgagct tgacttcaga caccccaccc actcccttgt cggtgcccgt 24960 
ttgctcagca gactctttct tcatttatag tgcaaatgta aacatccagg acaaatacag 25020 
gaagactttt mam tttgagacag agtcttactc tgttgcccag gctggagtac 25080 
cgtagcgtga gctcagctca ctgcaacctc cgcctcccag gttcaagcga ttcttctgcc 25140 
tcagcctcct gagtagctgg gactacagac atgcaccacc acacccagct aatttttttt 25200 
atatttttag tagagacagg gtttcatcat gttggccagg ctggtcttga actcctgacc 25260 
tcaggtgatc tgcccgcctc ggcctcccaa agtgctgaga taacaggtgt gagccaccgt 25320 
tcccggcata ggaaaacttt ttgccttcta aagaagagtt tagcaaacta gtctgtgggc 25380 
tggccttctgattctgtaaagaaagtttgattggtggctgggtgcggtggctcacacctg 25440 
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taatcccatc actttgggag gccgacgtgg gcatatcacc tgatgtcggg acttcgagac 25500 
cagcctcacc aacgtggaga aaccccgtct ctactaaaaa tacaaaaaaa aaattaaccg 25560 
ggcatggcgg cgcctgcctg taatcgcagc tactcaggag gctgaagcag gagaattgct 25620 
tgaacctggg aggcggaggt tgtggtgagc tgagatggca ccattgcact ccagcctggg 25680 
caacaaaagt gaaactccgt ctcagaaaaa aaaaagtttg attggtgtaa ccaaagcgca 25740 
tttgtttatg gattgtctgt ggcagctttt gttctgccga gatgagttgt gacagatctg 25800 
tatgggctct aaagcctaaa acatgtgcca tccgcccctt tacagaaaaa gtgtgctgac 25860 
ctctgttcta aagtattgga caactacaat gtttgctcat ttattattct atgatttgtt 25920 
ttctgctttt tgttgttgtt gttgttgttg agatagggtt tccctctgtc actcaggctg 25980 
gagtgcagtg gtgtaatctc agctcactgc agcctcgacc tcctgggctc tagtgatcct 26040 
ctcatctcag cctccctagt agctgggact acaggcacac accaccactc ctggctgatt 26100 
LUimiLL lllLULLLl ttgtggagac agggtttccg catgttgccc aggctggttt 26160 
caaactccta ggctcaaaca cccacctcag cctcccaaag tgctgggatt acaggcgtga 26220 
gccaccatgc ccagcctalt ctactgtttg tattacatag ctttaaaaga ttttttatga 26280 
ctttaagtca caagggttct ttgtagaaaa aaatatatat ataggaaagt ataaaaagaa 26340 
agtaaaaatt gtccataacc tctccagcca gagacgaccg ttgctgacac ctcagcatat 26400 

f 

tgcctttaag tcttttttct ctaagatagc atttctcttc atcacagtca tatgctacgc 26460 
agaattctgt atcctgattt tttcacttga cattacaaca ggtatttgat ggcgctgtga 26520 
caaactcttt ggcacaatct tttaaatgta tgaaatactc cactgcacag atgtttgctt 26580 
ttaggcttaa ctgttctttt attttgcgtg tgctggttac agccgggcac agtggctcat 26640 
gcctgtaatc acaacacttt gagagggtga ggcaggagga tcacttgagc ccagaagttt 2670O 
gagaccggcc tgggcaacat agtgagaccc catctctaca aaaaactttt ttaataagtc 26760 
gggcgtagtg gtgcatagct gtagtcccag ccaccaagga ggctgagttg ggaggattgc 26820 
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ttgagcccca ggaggttgat gctgcagtga cctgagatta ctccactgta ctccaacctg 26880 
agcgacagag caagacttgt ctggggaaaa aaaaaaaaaa aatatatata tatatatata 26940 
tatatataca tatatacata cacgcacaca cacataatat aaaaatatat atttataaat 27000 
atataatata taatataaaa atatatattt ataaataaaa tttataaatt atatttataa 27060 
gtaaatatat aatatataat ataaaaatat atattatata atatataata aaatatataa 27120 
tataaaaata tatatttata aataatatat aatacatact tataagtata tatttaaaat 27180 
atatgtaatg tatatttttt aatgtatgat atataatata catttataaa tacacattta 27240 
tattatttta tataaaatat atataaaatc tccaagttgc tttttccaaa aaggtgtctt 27300 
gctgcatttc aaacattcat ttaaaaactt gaatgctggt gatctggtcc agaatgtgtt 27360 
cagtagctgc tgccagtggc caagcatctc gggagatgtc tacaaaacac gctggttctg 27420 
gcctggcgtg gtggctcacg cctgtaatct cagcactttg ggaggctgag gcaggtggat 27480 
caactgaggt ctggatttcg agaccagcct tgccagcttg gtgaaacccc atctctacta 27540 
agaatacaaa aaaattagcc aggcgtggtg gcatgtgcct gtaatcccac ctacttggga 27600 
ggctaaggct ggagaatcgc ttgaacccag ggggcagagg ttgcagtgag ccgagatcgc 27660 
accattgcac tccaggctgg gcaagaagag cgaaactccg tctcaaaaaa aaaaaaaaag 27720 
atgctggttc ctaaaatgtg gcccttttcc tcctcacctg ctgccagacc atcagccgcg 27780 
ccttcatgaa cgggagctcg gtggagcacg tggtggagtt tggccttgac taccccgagg 27840 
gcatggccgt tgactggatg ggcaagaacc tctactgggc cgacactggg accaacagaa 27900 
tcgaagtggc gcggctggac gggcagttcc ggcaagtcct cgtgtggagg gacttggaca 27960 
acccgaggtc gctggcxctg gatcccacca aggggtaagt gtttgcctgt cccgtgcgtc 28020 
cttgtgttca cctcgtatga gacagtgcgg gggtgccaac tgggcaaggt ggcaggctgt 28080 
ccgtgtggcc ctcagtgatt agagctgtac tgatgtcatt agccttgatg gtggccagga 28140 
ctggtagggc cctcagaggt catggagttc cttcgtggag cgggtgctga ggctgtatca 28200 
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ggcacagtgc tggctgcttt cacctgggcc gtctcaccga agtgtccatg gagcctgcgt 28260 
agggtgggta tctgtgtcga ttttacagat gcagaaacag gctcagagaa accgagtgac 28320 
ttccctaagg tcacataccc agttagagca gagctgggcc aggaagtgct gtctcaggct 28380 
cctgaccagg tctccttgct ttgcactctt gccaaaacca tgatccagaa ctgactttga 28440 
ggtccccgga cctcaggctc ctccgaaatg gcctcttgga ggctgctgag ccacagctta 28500 
ggacccacct cgagaggcaa atgtgctttg agctgccagg cgtcctgggg gccctgcctt 28560 
gggcacgggg ttcagacagg ccccagatgt gtggggcgtc tttctggact tgagttttct 28620 
tttctgtgtg gtggacacag tgctcacccc ttaaagcacc tgtgatgtgt gcagcagccc 28680 
aatccctgcc tgtcgcctgt tctgctaggg aaggaaggaa gacttcagga tggcaggaca 28740 
acagaaagag gtccaggttt tagagcaagg gcaggtcaaa cttagaaaat tctggaatga 28800 
ggatgtgcat ttcctcttct ggatctgcta aaagaagagg gaaggagggg ctgctggggg 28860 
aggagcccag agccgagttt acatccggat cccgcaaggc ctcccctgcc ctgaggtctt 28920 
gttttgtgat gtgcttgtgt ccatcctggt ttctgccgtg tccccaacat ccggccaagc 28980 
ttaggtggat gttccagcac acactcaccc tgtctgtgca cctgtttttg tgtccgtaag 29040 
tgggtattta ctcaccttac gagtgagcca ctgtgggaat tcagggaggt ggcgcagtga 29100 
ccacccctgg agggatatgt gtgtggcagg ggtcgagggt ctcgcccttc cctgcttcct 29160 
gcgcgtggct ttctccagga cggggagggc tgagctgaag aggtggggac agttgcgtcc 29220 
ccccgccacc cactgtcctg cggtgagagc agactcactg agcctgccct tctcccttgt 29280 
gccttccagc tacatctact ggaccgagtg gggcggcaag ccgaggatcg tgcgggcctt 29340 
catggacggg accaactgca tgacgctggt ggacaaggtg ggccgggcca acgacctcac 29400 
cattgactac gctgaccagc gcctctactg gaccgacctg gacaccaaca tgatcgagtc 29460 
gtccaacatg ctgggtgagg gccgggctgg ggccttctgg tcatggaggg cggggcagcc 29520 
gggcgttggc cacctcccag cctcgccgca cgtaccctgt ggcctgcaag ttccccaacc 29580 
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tggcaggagc tgtggccaca cccacgactg cccagcagcc tcaccctctg ctgtgggagt 29640 

tgtccccgtccacccctgggtgcctttgctgcagttatgtcgggagaggctctggtgaca 29700 
gctgtttcct gtgcacctgc tgggcactag gtcccagcta atccctgtgc caggactcta 29760 
atttcaccct aacacacatg gtggttttca ttgctgggga agctgaggcc tgagcacatg 29820 
acttgcctta ggtcacatag ctggtgagtt caggatcccc cagagatacc agggccagca 29880 
ctcgatcccc acccagccct gaaccccacc atgtgctggg attgtgctgg gagtgtccac 29940 
acgcctggga ccccagggct ggtgctctca tctccttttt ccagatcatg agaatgaggc 30000 
tcagggaagt ttgaaaaaaa cctatcccaa gtcacacagc aacaggagca ggatttgaac 30060 
ccagaaaagg ggaccgcaca ctctgttctg ctagagtagt tagctgtcct gggtgatatg 30120 
gcaggtgaca ggggcaactg tgcttaacaa aggaaccccc atoccccctg ccaagttggg 30180 
agactagaag gtcaggggca gaagctctga agggccaggt gcagtggctg ac^cctctaa 30240 
toccagcact ttgtgaggcc aaggcgggca gatgatttga gcccaggagt tcaagatcag 30300 
cctgggtaat gtagtgagac gccatctcta caaaaaaatt ttttaaaaat tagctgggca 30360 
tggtggttca tgcctgtagt ccaagctact tgggaggctc aggtgggagg attgcttgag 30420 
cccaggaggt tgaggttgtg gtgagctgtg atcatgccac tgcactccag cctgggcaat 30480 
agagtgagac cgtctccaaa aaaaaaaaaa gaagaagaaa aagaagctct gaggctccaa 30540 
gtccccaggc accccttggc ttgagggcag acaagggagg agagggtcac ctgggcagcc 30600 
ctgacttttg tcccctggca aagggacctt cagtgacctt ggccctagga gagcctctga 30660 
gcacgtcagc catgtcgaac cgctcaggaa gggcagcaag aatttggctt ctgacctctg 30720 
cctctcctac tcgccatctg cactgggtgt ggttgtgccc attttacaga tgaggaggct 30780 
ggggcatcga ccagctgaat gccttgtccc aggtactgcg taggcagagc tggcagttga 30840 
accccgtgtc ctggttgtcg ctgggggtgg gctgcaccct gacttgtgag gccagtagca 30900 
aggtttgcac gtgacttcgt gaccgtcacc cagctctgca gcacatcccg tgacccagct 30960 
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tggcaggagc tgtggccaca cccacgactg cccagcagcc tcaccctctg ctgtgggagt 29640 
tgtccxcgtc cacccctggg tgcctttgct gcagttatgt cgggagaggc tctggtgaca 29700 
gctgtttcct gtgcacctgc tgggcactag gtcccagcta atccctgtgc caggactcta 29760 
atttcaccct aacacacatg gtggttttca ttgctgggga agctgaggcc tgagcacatg 29820 
acttgcctta ggtcacatag ctggtgagtt caggatcccc cagagatacc agggccagca 29880 
ctcgatcccc acccagccct gaaccccacc atgtgctggg attgtgctgg gagtgtccac 29940 
acgcctggga ccccagggct ggtgctctca tctccttttt ccagatcatg agaatgaggc 30000 
tcagggaagt ttgaaaaaaa cctatcccaa gtcacacagc aacaggagca ggatttgaac 30060 
ccagaaaagg ggaccgcaca ctctgttctg ctagagtagt tagctgtcct gggtgatatg 30120 
gcaggtgaca ggggcaactg tgcttaacaa aggaaccccc atcccccctg ccaagttggg 30180 
agactagaag gtcaggggca gaagctctga agggccaggt gcagtggctg acacctctaa 30240 
tcccagcact ttgtgaggcc aaggcgggca gatgatttga gcxcaggagt tcaagatcag 30300 
cctgggtaat gtagtgagac gccatctcta caaaaaaatt ttttaaaaat tagctgggca 30360 
tggtggttca tgcctgtagt ccaagctact tgggaggctc aggtgggagg attgcttgag 30420 
cccaggaggt tgaggttgtg gtgagctgtg atcatgccac tgcactccag cctgggcaat 30480 
agagtgagac cgtctccaaa aaaaaaaaaa gaagaagaaa aagaagctct gaggctccaa 30540 
gtccccaggc accccttggc ttgagggcag acaagggagg agagggtcac ctgggcagcc 30600 
ctgacttttg tcccctggca aagggacctt cagtgacctt ggccctagga gagcctctga 30660 
gcacgtcagc catgtcgaac cgctcaggaa gggcagcaag aatttggctt ctgacctctg 30720 
cctctcctac tcgccatctg cactgggtgt ggttgtgccc attttacaga tgaggaggct 30780 
ggggcatcga ccagctgaat gccttgtccc aggtactgcg taggcagagc tggcagttga 30840 
accccgtgtc ctggttgtcg ctgggggtgg gctgcaccct gacttgtgag gccagtagca 30900 
aggtttgcac gtgacttcgt gaccgtcacc cagctctgca gcacatcccg tgacccagct 30960 
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catccaggcc gcatgcaaac ctgttgccag gcgagaaacc agtcacxgca cagctgtggt 3 1020 
tgcctgaaat gattaagctc attaatcacc ccggagtgag gacagactca gatgaaaacc 3 1080 
agcaaaagcc ctggaaactc atgtgaccct gccaatgagg gcggccatgt gcattgcagc 3 1 140 
ctggccgtca ctcctcggta cgtgttttgg acttaaacgc tccggatgtt tactgagtgc 3 1200 
ttgattaata acatggaagg cctggtctca ttgctgtggg agtgaaggat gcacagccag 3 1260 
gcctgacatg atgagaacaa gaacctggag tctcgctgcc tgggtggtaa tcctggccct 3 1320 
gccacttagc aactgtgtga ctgtagccag gtcacttaat tttgctagat cctgcctgcg 31380 
cttcagtgga tcttgctggt tttccaaggt ggccaaacac tttaaggcat tcatgtggtc 31440 
gctaggctgc agggttgaac cctggctcac cccgcagggc gccgtgtgct ctgtggcctg 3 1500 
gctgtgcctt tgctgacacc gtgcccgtgt gtgttcatgc aggtcaggag cgggtcgtga 3 1560 " 
ttgccgacga tctcccgcac ccgttcggtc tgacgcagta cagcgattat atctactgga 3 1620 
cagactggaa tctgcacagc attgagcggg ccgacaagac tagcggccgg aaccgcaccc 3 1680 
tcatccaggg ccacctggac ttcgtgatgg acatcctggt gttccactcc tcccgccagg 31740 
atggcctcaa tgactgtatg cacaacaacg ggcagtgtgg gcagctgtgc cttgccatcc 31800 
ccggcggcca ccgctgcggc tgcgcctcac actacaccct ggaccccagc agccgcaact 3 1860 
gcagccgtaa gtgcctcatg gtcccccgca cctcactccc tcgttagatc aggctggttc 3 1920 
tgggagctga cgctgaaagg agcttctcat ctggggttcc tgggtgtaca tagatggttg 3 1980 
ggtaggttgt gcactgcaca agc^catga tgctacctgg gggtccaggt ccaggctgga 32040 
tggacttgtt gcttcatcag gacatagata aatggccaaa actcctcagc tggaaggtcc 32100 
tgggcaggat ctttgggtgt gaaaaccagt cacaggggaa gggtgcttgc tcatactgcc 32160 
agcacagtgc tgagtgcttt ccatagcgct cgtttactcc tcaagcctgg agggtgggga 32220 
gtagcatggt cccatttcac gtacaaggaa cccgatgcac agagaggtgt ggcaacccat 32280 
ccaaggccatacaactggggtgggttgagccggggttgactgtggcaggctggctcaaga 32340 
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gtccctgctc ctgaaccctt gccaggcagc ctggcatcag ctcggggaat ttttgccctg 32400 
acccttggaa gcaagtgggc ctctttgttc tcatgtcagt gatgagaaga gtgactttcc 32460 
tatggcccct ctggagtaca ggtgtttcct gttggcgggc tcttccccca tgacatcagc 32520 
agcgagctgg ttatgattcc ctacgcagaa cttgatagtt tataaagctc tttgtcatcc 32580 

aggceeegtt ggagtctcac gcagacctgg tcgcaggcgg ggctggtctt gcctgtccca 32640 

gctgcatgga tggggaactt gaggcttgca aaggttaagg ggctgttcga ggcccaggct 32700 
ggcaggagat gggcctgggc cagagtctgg gacttcccat gcctgggctg tctttggtcc 32760 
tgttgctcac catccctccc tggggccatg accttagaga gccaaatgga ggtgcaggta 32820 
acccacggca aggaggggtt gccatgactc agagtccccg tcctgtggcc ggcagtacct 32880 
ggtgcaacga cttggatttc agaccagcca ctgtagcccg ctgacggtgc gctcgaagtg 32940 
ccacagcttctgaagccaggcaggactcaggccaggagactctgttagctgttgagaggg 33000 
agaggcxaac ggatgttctg gttctgctag agagctggtt cttcggatcc tggtaccagt 33060 
gcactgagag gaggcccagc ttgattctgg ggctgccttg tggtggcatg tgctgctcac 33120 
tgacaccctc gaggagtgtc ttctctcggg cttgttgact gtgcccggtt ttccgcagtt 33 180 
cactggtgca cacataggca catagcaaac cgcacacaca gtcgtgggta tgagtttcac 33240 
tacattccac caccagtgtt cactaccatt acctgccttc cgtcttaagt gttcatcatt 33300 
taaaaataaa tttattgggc tggacgcggt ggctcatgac tgttatccca gcactttggg 33360 
aggctgaggc gggcagatca cctgaggtca ggagttcaag accagcctgg ccaatatggt 33420 
gaaactccat ctctactaaa aatacaaaat tagctgggca tggtggggca tgcctataat 33480 
cccagctact caggaggctg aggcaggaga atggcgtgaa cccgagaggc agagcttaca 33540 

gtgagcccag atagcaccac tgcagtccag cgtgggcaac agtgcgagac tccatctcaa 33600 
aaaaaaaata aataaataaa agaaaaataa atttatgatc tatttcaaaa ataacacatg 33660 
tactttgaaa cagcagagac acatatgaca cggagaatga ^ttccceat agcgcaeccc 33720 
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caagagacag ccctggtccc cccgtctttc ccgtggacct ccagcggggc agatgctgag 33780 
ccgcctgttg tcgagtggcg tgctatcccg tcctccagct cctctgtggc ttacagacac 33840 
ccacctgcag ccctgtcttt gcctcctcta gcgcccacca ccttcttgct gttcagccag 33900 
aaatctgcca tcagtcggat gatcccggac gaccagcaca gccxggatct catcctgccc 33960 
ctgcatggac tgaggaacgt caaagccatc gactatgacc cactggacaa gttcatctac 34020 
tgggtggatg ggcgccagaa catcaagcga gccaaggacg acgggaccca ggcaggtgcc 34080 
ctgtgggaag ggtgcggggt gtgcttccca aggcgctcct cttgctggtt tccaggctgc 34140 
tgcccctgtc cttagcagag ggaggaaaca gaggatggct ctgggtgaat gatgacttgg 34200 
gcttcgatta tgtagtcaca gggtatgacc ctgagatgcg tggaaccccg agactgtgat 34260 
tatatgtaga aactgggttt ccccgttgtt taagtagtca tggtggggtc agaccccaca 34320 
ggacttttgt cttttcaaga aagaaaatgg tcgtgtgtca tgcaggggta gttggtactg 34380 
gttaatccag gtttatcctt tattttgtgg gaactgtaca gtcatttctg ctacaatgct 34440 
gtatatgctc ttctgaaaga cacctatgca aaatcgcaca gtaaaaatga cacaactcat 34500 
agggaaagcg gggccagggc acagccctca aaatctccat caatgacatg taagaaaaga 34560 
gaggaacctg ggaaatagca aagtgccttt tgcacattaa atggttagct atatcccaca 34620 
atactgtgca ttcgtaaacg ttaatgctgc aataaatacg gcacttcacc ttgggaagat 34680 
ctggagttgg cttatgagtg tggaagggtg tagcgcatga gtttttgtga aacactggaa 34740 
ggaggattgt gggaaatcaa atggaaagtt ctcaccccag gcgtggagaa gagtgggtca 34800 
tggccccagc agtgagccca gggaggtcag agacggaggt gtgtgtgtgg gtgtgaccct 34860 
gcgcagttcc ctgccggctg tagttttttg cattcgctta atgtttctcg tggaggaaat 34920 
tgtgcatgag caaatgtgaa accgtgctgt gctcaaattg tcctaataca tcattgcatt 34980 
ggaacagatt ggcttttttt ILLllLilLl tttttttttt tttttgagat ggagtctcac 35040 
tctgtcacca gcctggagtg cagtggcatg atcttggctc actgcaacct ttgcctccta 35100 
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tgttcaagtg attttcctgc ctcagcctcc tgagtaactg ggattacagg catgagccac 35160 
cgcggccggc cagatttgca tttttgaaac aactgctagg ctgggcgcgg tggctcacac 35220 
ctgtaatccc agcactgtgg gaggccgagg caggtggatc acctgaggtc aggggttcga 35280 
gaccagcctg gccaacatgg tgaaaccccg tctctactga atatacaaaa atcagctggg 35340 
tgtggtggcg ggtgcctgta atcccagcta ctcaggaggc tgaggcagga gaattgcttg 35400 
aacccaggag gcagaggttg cggtgagccg agatcacacc attgcactcc agcctgggca 35460 
acaagagcaa aactccatct caaaaaataa aaaatagaaa aacaagtgct gtagcggaag 35520 
tgagcacttt gcggagtcag gcttgtgtgg cctgttccac aaatgatgtg ctcacggtgg 35580 

cctcaggccc acctggagtc tgcagcatgg ggcacaacag gttcattagt gtagaattcc 35640 
aggacaggcctggctcctaagcagccttcttttacaaaaactgcagagcccgcctgtatc 35700 
ctagcacttt gggaggccga agtgggtgga tcacgaggtc aggagttcaa gaccagcctg 35760 
gccaacatgg tgaaacccca tctctactaa atatacgaaa attagctggg tgtggtggca 35820 
cgcgcctgta gtcccagcta ctcgggaggc tgaggcagaa ttgcttgaac ctgggaggtg 35880 
gaggttgcag ggatctgaga ccatgtcatt gcactccagc ctgggcaaca gagcgagacg 35940 
ccatctcaaa aaaaaaaaac ctacagagcc acacggcctc tttctccacc gagtgttggt 36000 
gtgggagctt gtgttattgt ggtgaaatct tggtactttc ttgaggcaga gagaggctga 36060 
gcgcctggag agactttcac atgggtcgcc atgtccgccg tcggtttcgc tgttgtgctc 36120 
cccatctgaa ggctggtgcc gtccagacag gctggacgcc cctttccacc agatccttcc 36180 
tcccgcagca gtttctagtt acgttgtact gtgaggtctg tgtccttggt tgatggcaaa 36240 
agtcagccga attgaaattc agagccatgc ctggctccct ggagcttctc tcctgggcag 36300 
ctgtgatcat tgcctctgct gtggtgtggg tggtggaaat ggattccttt catcttgctt 36360 
gctacaggtg actgtcacgt ggagtccttt ggagagaggg acgtgttaat tgatggatgt 36420 
ggctcccatg ctgagaaagc tcctgggcgt acattgcctt agagtttcat tggagetgeg 36480 
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ttcttttatg gtgtctgcta ggcagaagtg atgaagactt ggaagaaaac ccagaaggtt 36540 
ttccacttaa tttggaaaat gtgcttttcc cctcctgtgt cttttgctaa ggtccagcct 36600 
cctgcagcxt ccccgctctg tggactctgg ctttgattct ttattaggag tccccctgct 36660 
cccccaaaag atggtgtcta aattatcatc caattggccg aggttttgtt ttctattaat 36720 
tgtttttatt ttttattgtg gtaaatttat ataacataaa atttgccatt ttaattgttt 36780 
tgttattgtt gtttttgaga cagggtctca ccccagtgcc caggctggag tgcagtggtg 36840 
cgatcatggc tcactgcagc ctcagcctcc agggctccag tgatcctctc acctcagcct 36900 
ctctagtagc cgggactaca ggcatacact accacatctg gctgattttt tgtatttttt 36960 
ttttattgta gagacccgct atgttgccca ggctggtctc aactcctgga ctcaagccat 37020 
cctcccacct caccctccca aagtgctggg attacaggca tgagccacaa cacccagcca 37080 
ttttaatttt tttttttttt tttgagatgg agtctcactc tatcgcccag gctggagtgc 37140 
2^gtggcgtgg tatcaactca ctgcaacctc tgcctcccag gttcaagcga ctctcctgcc 37200 
tcagcctcct cccgagtagc tgggattaca ggtgcccatc actatgcctg gctaattttt 37260 
gtatttttta gcagagacgg ggtttcacca tgttggccag gctggtcttg aactcctaac 37320 
ctggtgatcc gcccgcctcg gcctcccaaa atgctgagat tacaggtgtg agccaccgtg 37380 
cccggccttt LLLLgULLL gagacagggt cttgccctgt cacccagact ggagtgcaat 37440 
ggtgggctct tggctcactg cagcctccgc ctcccaggct caagttgtgc acctccacac 37500 
ctggctaact gtattttatg tagagacaga tttcaccatg ttgcccaggc tgggcttgaa 37560 
atggactcaa gcagtccacc cacctcagcc tcccaaagtg ctgagattac aggcgcgagc 37620 
caccgcaccc agcccatttt acctattctg cagttgacag ttcagtggca ttcagtcagt 37680 
tcacgaggta accatcactg ccattcatct ccagactact tcaccttctc ggcagatgtc 37740 
cgaaactgtc cgcattgaac acactcctca tctccctctg acagccacca ttctactttg 37800 
tatctctctc tgccttctct aggtacctca tgtaagtgga attataccaa tatttgccct 37860 
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tgtgtgactg gcttctttca tgtgacatgg tgtcctcaag gttcatctgt gttatagcct 37920 
gtgtcagaat ttccttcctt aaagcctgaa taataacccg ttgtaaaggc tgggcgcggt 37980 
ggctcacacc ctctaatccc agcattttgg gagtccgagg tgggcagatc acttgaggtc 38040 
aggagtttga gaccagcctg gccaacatag tgaaaccctg gctctactaa aagtacaaaa 38 100 
ttagctgggt gtggtggcgc gcacctgtaa tcccagttac tcaggaggct gaggcaggag 38160 
aatcgcttgt acccgggagg cagaggttgc agtgaaccaa gattgtgcct ctgcagtcca 38220 
gcctgggtaa cagagtgaga cttcctgtct caaaaaaaaa aaaaatcatc ggatggatgg 38280 
acggaccact tcttgttatt tatccatcca cgggtgctag gtttcttcca cctttggttg 38340 
tcgtgaataa ggccactatg aacatttcct tccgtggtga aggttttgta ctagtgagga 38400 
aaaggcgtgt ttgtggtgtt gcataggatt ctggtaagaa agtttgcact aaccataagt 38460 
atttgtacta cattaaaatg aaagctcagg ggccgggcgc ggtggctcac gcctgtaatc 38520 
ccagcacttt gggaggccag ggcgggcgga tcatgaggtc aggagatcaa gaccatcctg 38580 
gccaacatgg tgaaaccccg tctctactaa aaataccaaa aaactagcca ggtgtggtgg 38640 
cgggcacctg tagtcccagc tacttgggag gctgaggcag gagaatggcg tgaacccggg 38700 
aggcggagct tgcggtgagc cgagatcgct tcactgcact cgagcctggg caacagagca 38760 
agactccgtc tcacgcaaaa ctctgtctca cgcaagactc cgtctcaaaa aaaaaaagag 38820 
ttcagggttt atgaaactgg ccagccgcgt aaagtttgct gtgttgtttt tgtgcccggg 38880 
aggagtgtgg ccagggtgtc acgtcacaca gtacacgttt ctcagatggt ggttctccag 38940 
actgctgtcc caaagtctgt ttttgcatct ggttcccaca gacccaccct ccacggtgag 39000 
cctgattttg gccagggtag ctggaatctt gcttgtcttt cagcccggca gctgtaccag 39060 
tccagggtcc acagctagtg gcttttagga aggaatttgt tcagttggct ttgacacatg 39120 
gccccctagg gtccacagct ctgtagtgat gtggatgttg ttatctacaa agacacatga 39180 
tccttcgtgt ccagatgaaa gtgatgatgt ctttgcagct gcccagcaag gctgtgtgtg 39240 
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tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tggtgtgtgt gtggtgtgtg tgtgtgtatg 39300 
ggggagggag gcaccctttc catctggggg tgtgtgtgtg tggggtgtgt gtgtgtgtgt 39360 
gcgcgtgtgt gtggtgtgtg gtgtgtgtgt gtgtatgggg gaggcaccct ttccatctgg 39420 
gtccaagaga ctgggcctgg ggaagacgct tctttttatc tacttagaga ctttgtttta 39480 
tttgtatttt tttgagacag ggtctcactc tgtcacccag gctggggtat ggtgatatga 39540 
gcatagctca ctgcagcctc ggcctcccag gctgaagcga tcctcccacc tcagccttct 39600 
gaatagctgg gactgtaggc gtgcgtcacc atactgagct attgtttttt ttgtttggtt 39660 
ggtttaattt tttttgatac agatggagtc ttgctatgtt gcccagacta gtctcaaact 39720 
cctgaactca agtgattctc ccacctcagt ttcccgacat tctgggatca caggtgtgag 39780 
ccactgctgt ctccctgttt tattaactgc tgaaagacct agataaagaa agtctgaaaa 39840 
gacttactat cagagcacca tcctaagatg attccctctg actcaatgga gagggagggg 39900 
agcttttcct tcaggcctgg gtggcaggag cccaggtgct ccaggcccca tttgcccxag 39960 
gccaaatcac tcgggaactt ggatgcagct gtctttcagg gtaacccaaa ggaaccagat 40020 
ccccgcaggc agtaggcttc tgggctgtcc tctcctccta cgtcagctca gtaagagccc 40080 
ttcgaaggga tgctgtgtcg gaggccccaa aagcccaggc tcatccctga gatgcacagg 40140 
gtgggctggg cttaggcagc gctcgagcat ctcctggacg gtgaccccag agagtgtgga 40200 
gacggagagt ccttgagagt cactgagaga cgtggctgcc ctgccttccc aagaggggct . 40260 
ctgagtcatt ccccacactc acctgcxcct acccaccctc acctggcccc cagcctcacc 40320 
tacccccaca tctgtaccga tccctttacc cgcaccttcc ctacccaccc tcacctcccc 40380 
tgtaccttca cctcccccac tcacccgccc ctgcaccctc acctgtcccc caccttcacc 40440 
taacccccac cctcacctgc cctcccctca cctggcctcc ttccgttggg gaaggggttg 40500 
taaggggcgg cccccaaact gtctgtcctg gtgccctgca gagaaaacag tacgtgaggg 40560 
ccgcagtcca aaagcttgag tcctggaagg tggaggagac agggatgtgt tgggaagggc 40620 
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cccatggtct tggatccctt ctcgactgtc aatggggcct tcatgggagc gccagtctag 40680 
tgatgcacag ctgggtgccc ggcgggtggc tgaggaggcc taaagtccga ggcggcaaga 40740 
gctcttccag aggctgttgt cctaatcgct ctggcatact caggcgggca cgtagttagg 40800 
agctgattgg agaggagaga cccccacacc aatactggga tttgactttc aggctaaact 40860 
tgagaagtgt ggcctctgct gtcctgccag agctctccag ccagtgccca gggctctcca 40920 
gccagtgccc gggggtctcc accagtgccc gggggtctcc gccagtgcca ggggtctccg 40980 
ccagtgccca ggggtctccg ccagtgctca ggagtcttgg tttctttgtc ttacagccct 41040 
ttgttttgac ctctctgagc caaggccaaa acccagacag gcagccccac gacctcagca 41100 
tcgacatcta cagccggaca ctgttctgga cgtgcgaggc caccaatacc atcaacgtcc 41 160 
acaggctgag cggggaagcc atgggggtgg tgctgcgtgg ggaccgcgac aagcccaggg 41220 
ccatcgtcgt caacgcggag cgagggtagg aggccaacgg gtgggtgggg gtgctgcccg 41280 
tccaggcgtg cccgccgtgt cttatgccga atgccagcct ctcacaggct ggggagactt 41340 
tccacctggg gatccaatgg gtggctttcc agggtcccaa aagcaaacac aggtttttca 41400 
cagcccgtcc gggaaagcag aaagccccaa ggggctggaa ggggaaaggg ggagctctgc 41460 
tgagaggtta caaggcagcg ctggccgacg ggagttgcag ttgataggtt ttgtatcatc 41520 
cttgttaaac ttgaaccctg tgcagaaatc ccttccacgg catgggggct gcctgttgac 41580 
tcgctcctgt tccaccacag ggagctcctg ggcttcttcc tcccagaggc ccccgacgct 41640 
cccacctgtt ggtcgtcaga gcttctggtt ggtgggaagg cacccaggac cttgaggtct 41700 
ccagagagaa aagccaggga aagagggaga ccgaaaccca tgtgacatga aactcaggct 41760 
ccaaactgag cacgggaacg tttggggaca ggagcgcgat ggccttcctc agatagctgg 41 820 
ggggctggca tgaagacggg agctacagcc agcacaggtc ctgggccggg agcccagaga 41880 
ttgagccctg actctgtcac ttactggcca cgtgaccttg ggcgggtggc atagcctctt 41940 
ggagactcag tttcctcatt ggtaggagtg acggccacag tggtgcggcc tctgcagcac 42000 
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acggggggct cggtgggcgg aagccccggg tctataaggc ggctgtgcag gagccagccg 42060 
agctggtctc ccaacagcca gggctccggg gtccttagca gctgtggggg gcctgcacct 42120 
gtttcccatg gctgctgtca gaaattacca gaagccaggt ggctgagagt aatggacact 42180 
tgttctctca cagttcctga gggctgaagc ccgagatcga ggtgtgggca gggccctgcg 42240 
ccctctgaag gctctgaggg aacctttggg cttctggtgg ctccaggcac cccttgactt 42300 
gtggtcctgt cactccagtc tctctgtctg gctgcacatg gcgtggcctc ttctgtacca 42360 
ttgaaggaca cttcagttgg atttagggcc taccctcacc cattgtggtc gtatcttgat 42420 
ccttcatgac atttgtaaag accctgcttc caaataagct cacattctga ggttctgggg 42480 
tgagcgggaa tttggagagc attgttcaac tagtatagaa tgtgacctgt cagcctcggg 42540 
cagccctgag aggcaggggc tttccacagc ccagctgggt gccctgggct ccgtgctgtc 42600 
cgaggagacg ccatccccac acccgtcctt cacccgccac cctcccgcag gtacctgtac 42660 
ttcaccaaca tgcaggaccg ggcagccaag atcgaacgcg cagccctgga cggcaccgag 42720 
cgcgaggtcc tcttcaccac cggcctcatc cgccctgtgg ccctggtggt agacaacaca 42780 
ctgggcaagc tgttctgggt ggacgcggac ctgaagcgca ttgagagctg tgacctgtca 42840 
ggtacgcgcc ccggggcctg ccctaaccgc agacacccgg ccttcattgt cagtaatggc 42900 
agcagctgcc acattgtccg agacctgccg tgagcccagt gccgcgccag gggctttgtg 42960 
tgtagcgtgt tttgtcctca cactgacagc tgtaggctgg ggttctgagt gagccccaca 43020 
gggcagaggc agaaaatgag tctcagagag ggtgagcgag ctgcttgggg ccccacagca 43080 
ggagatggag caggactgca gcctagcctc tgcccccagc acctgcgcaa gaagctgctc 43 140 
tgctctggac tgtgttaggc tgcgagggct ggagagaaat gagagttggt gcttagagag 43200 
ggggcgcagg tccccatggc ttttcctctt atgatgaggt agatgggtga agggaggggc 43260 
catgcttgca ggggccagtg accgaggccc gccgttggaa ctgatggcct tcatcccgag 43320 
cccagcccag gtgggagcag ggctttccga gggcttgtct tgggtcggcc tgcttccagg 43380 
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gactctgctg cagctcccac ccctgtccaa agcatggaat cccccaggct ccctggcagt 43440 
cctgtcaacc tctgtcctcc caagctgagt gtggggcaag ttctggaggt cagcactgct 43500 
caggggggcc cacgggctgc ttgcaggggc caaccgcctg accctggagg acgccaacat 43560 
cgtgcagcct ctgggcctga ccatccttgg caagcatctc tactggatcg accgccagca 43620 
gcagatgatc gagcgtgtgg agaagaccac cggggacaag cggactcgca tccagggccg 43680 
tgtcgcccac ctcactggca tccatgcagt ggaggaagtc agcctggagg agttctgtac 43740 
gtgggggctg gcagtggggt gggcagggtg gcctctaaac ccgacccctg gaggaggctg 43800 
gaggccagtg caagatcctg tgtggcctca gccaggcggt ggtctctgcc agatgccaac 43860 
tgttgcccgc tggggttcag cgacatgtcc gaatgtcccg aggcctctga ggttgttttc 43920 
ttttgccgca gaacaaatca ccacgaacag cgttttaaga caacaccaac tctttttttt 43980 
tttttttttt tgagtcagga tcttgctctg ttgcccaggc tggggtgccc tggtgcaaac 44040 
acagttcact gcagcctcga cctctgggct taattaagtg aacaccttgc ctcagcctcc 44100 
caggtagctg ggactacagg tgggcaccac cacacctggc taattttttt ttgtagagac 44160 
ggggtttccc catgttgccc aggctggtct gcaactcctg ggcacaagct atctgcctgc 44220 
tgtggcctcc caaagtgcta ggattatagg tgtgagccac tggcctgaca acacccacgg 44280 
attgtctctc agttctgtaa ggcaaagtcc aggcacagcg tggctcacct gggttctctg 44340 
ctcagggtct cacggggcca gaatcaaggt gtcaggaacg ctgggccctc agcggaggct 44400 
ctgtggagaa attagcttcc ttgctcactc agcaggtagc agttgtggga tcgaggttct 44460 
gttttctctc tggttattgg tcggggacca ctotcagctc ctagaggcca ccacaggtcc 44520 
ttgccccgtg gccctctctg cctcagcagt gggggctccc tgcgtcagtc cctcccacac 44580 
cttgagtctc tctgatttgc ttctaaaggg ccctgtgatt cggctcagcc acctttagat 44640 
taggttagcc tcccctttga tagactccaa gtcggctgat taataacctt aatcacatct 44700 
gcagaatccc ttctgccaca taaggtcatg acgccgtgct ggggactggg gtgggaaatt 44760 
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acggggtcat ttaggattct gcctgccact gccttgctgt gtcccagggc ttgggggagg 44820 
ggcctccaca gctgggacca cagtccttcc tcccctccat ggtaaccatc tgaggattac 44880 
ttgagaccag cctgggcaac atggtgagaa cccatcccta caaaaaatac aaacaaaaag 44940 
ggaccaggct gggcttggtg gctcatgcct ataatcccag cactttggga gaccaaggtg 45000 
ggctgatcac ttgaggttgg gagttcgaga ccagcctgcc caacatagtg aaatcccgtc 45060 
tctactaaaa atacaaaaat tagctgggtg tggtggcagg cgcctgtatt cccagctact 45120 
ggggaggctg aggtgggaga attacttgaa cctgggaggc ggaagttgca gtgagccaaa 45 1 80 
attacgccac tgcactccag cctaggcaat agagtgagac tccgtctcaa aaaaaaaaaa 45240 
gggccagggg tggtagtgac aaagagaccc tatcccaaaa aaaccgaaca ctgaatcctt 45300 
gagactgagt aaggacactg tgaaattttt ctgggtgggg cagggaacag agcgtcttct 45360 
gtcatttctt ccacctgggt gtggtcagct ctccctx:caa gctgcctcct cttcttctca 45420 
ttgtccgggt gttggacaca tttggttaac tggatagaat aacgcgagtt cccagggact 45480 
tggtcxattt gctattttat tttattttat tttattttat tttatttatt tatttattta 45540 
tttatttatt tattgagatg gagtttcgtt tttgtcgccc aggctggagt gcagtggcgc 45600 
gatctcggtt cactgcaacc tctgcctccc aggttcaagt gattctccta cctcagcctt . 45660 
ccaagtaact gggattacag gcacccacca ccataccagg ctaatttttt tgtattttta 45720 
gtagagacgg gttttcgcca ttttgcccag gctggtcttc aactcctagc ctcaggtgat 45780 
ccacgcacct cggcctccca aagtgctggg attacaggca tgagccacca cgcctggcac 45840 
catttgctat tttaattccc atgtgtatta gtgtcccacg gctgctgtaa caaatgacca 45900 
caaactggat ggcttaaagc aacagaaatg gattccccca atgtgctgga gaccagaagc 45960 
ctgcgaccaa actgttggga gggctgtgct tcctctgggg gctccaggga ggatctattt 46020 
gttggccctt ccagtgctgt gggtgccagc gttccacact tgtggatgcg ccgcctcaac 46080 
ctctgcccat cttcatgtgt ccatctcctt tgtgtctgcg tctttacctc ttcttcttgt 46140 
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ctgtgttgcc tcttataagg acgtttgtca ttgggtttag ggcccaccca aatcatccga 46200 
gatgacctcg tcttgagatc cttaacctgc aaagaccctt tttccaaaaa aaggttatgc 46260 
tcacagattc taggccttaa gacatgggtg tatctttctg gggggcacta tccaacccct 46320 
tatacaatgaaagacgggaagagggccaggtgtggtagttcacgcctgtaatctcagcac 46380 

tttaggaagc tgaagcggga ggatcacttg agcccaggag tttacaagta gctaggcaac 46440 
atgatgagac cccatttcta caaaaagtga aaaaaaaaaa aaaaaaaaaa aagccaggtg 46500 
tggtggctca cacctgtaat cccagcactt tgggaggctg aggcaggcag atcacgaggt 46560 
caggagattg agaccatcct ggctaacacg gtgaaacccc gtctctacta aaaatacaaa 46620 
aaattatggc cgggcgcagt ggctcccgcc tgtaatccca gcactttggg aggccgaggt 46680 
gggtgaatta caaggtcaag agatcgagac catcttggct aacacggtga aaccccatca 46740 
agatcacaag gtcaagagat ggagaccatc ctggctaaca cggtgaaacc ccgtctctac 46800 
taaaaatacaaaaaattagccgggcatggtagcgggcgcctgtagtcccagctgctcggg 46860 
aggctgaggc aggagaatgg cgtgaacccg ggaggcggag cttgcggtga gccgagatcg 46920 
ctccatgcca ctgcactcca gcctgggtga cagagtgaga ctccgtctca aaaaaaaaaa 46980 
aaaaaaaaaa aaaaaaagaa aattagccag gcacagtggc aggtgcctat tgtcccagct 47040 
acttgggagg ctaaggcagg agaatggcat gaacccggga ggtggagttt gcagtgagcc 47100 
gagatcatgc cactgcgctc cagcctgggc gatagagcaa gactctgtct caaaaaaaaa 47160 
agccaggcat ggtggtgcat gcctgtagtc ccagctactc aagaggctga ggcaggaggg 47220 
ttgttcgacc cacggagatc aaggctacag tgagccatga tcgcaccact gccctccagc 47280 
ctgggtgaca gagtgtgacc ctgtctcaaa gtaagtaaat aggaggagag acaagtgggc 47340 
agttcagact gatggtatgg gcacagtaga gactggtgca gacaggctgg cctgtgatgt 47400 
caagcaactt ctgtaattgt ttccggcatc catttgtgtg tcaatttccg tgtcagtagg 47460 
aagactctgt aggctgccaa gaggaataag tgggaggatc ctcccagaga ggccgggcct 47520 
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gcaggagggc cagttctcat gagttctcat ttggccccta ccctccaggc tgtggttctg 47580 
aggtgggaga cagagcctga cctctgtttg tcttgttttg tctttgcagc agcccaccca 47640 
tgtgcccgtg acaatggtgg ctgctcccac atctgtattg ccaagggtga tgggacacca 47700 
cggtgctcat gcccagtcca cctcgtgctc ctgcagaacc tgctgacctg tggaggtagg 47760 
tgtgacctag gtgctccttt ggggtgatgg acaggtacct gattctctgc ctgctaggct 47820 
gctgcctggc atccttttaa aatcacagtc cctgtggcat ccagtttcca aagctgattg 47880 
tgtcttcctt tgccctcctt tcttttctac tatgtgcatt cggtgctatg aattttcctc 47940 
taagtactgc gtttcctgca tctcacaaat tttgttacat tttcattttc aggtagtttg 48000 
aatattttta cacttctcct gagatgacat ctttggctca tgtgttattt agaagtgttg 48060 
cttagtttct aaagagttgg ggcttttcca gctgtdtctc tgcaactgat ttctaattta 48120 
attctactgt agtctgagag cttattttat atgatttctg ttattttaaa tgtgttgggt 48180 
gtggtgtttt tgttgttatt gtttttgtgt ctttttgttt tgttttgctt cgtttgtttt 48240 
gtttttgaga cagtgtcttg ctctgtcact caggctggag tgcaatggcg cgatctcagc 48300 
tcaccgcaac ctctgcctcc cgggttcaag tgatcctctt gcctcagcct cctgagtagc 48360 
tgggattaca ggtgcacgcc accataccca gctaattttt gtatttttag tagagacggg 48420 
gtttcaccat gttggtcagg ctggtctcga actcctgacc fcgtgatccg cccacctcgg 48480 
cctcccaaag tgctgggatt ataggcgtga gccactgtgc ctggccatta ggtgtgtttt 48540 
atcacccagc atcatgcagt ttatcttggt gaatgttctg tgtactcttg aaaagaatgt 48600 
ggattctgct gttgttgggt ggagtgttcc agaaacatca attagatcca gttggttaat 48660 
agtgctcatc aggttgtctc tatccttcct tcctgactgc ctgcttgagc ^tcagttat 48720 
tgacaggggt gtggagtctc caactctaat ggtggatttg tttatttctc ctagtagttc 48780 
tatctttttc tctccttcta cccttgatcc tcttctcccc ctagggcttc ctggtgttag 48840 
tggtgggaga gtggggtagt gaagaacctg gactttaggg ccaaagaggc cagggttcaa 48900 
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atcctggctc tgtcacttcc cagttgagtg accctggctg gtgcctgaat ctctgtgagc 48960 
ctccacttcc tcctctgtga aattgagagc acttacctgg caggctgtca tgggcatcaa 49020 
gtaacagggc actccacctg gaccctgaca cgtgatgcac aggaatgcca gctgctatgc 49080 
catgggtgtg gcagtagtaa taaagtgacc atctgtatcc tcaccacagt gaagcctgtc 49140 
cagggctttc tctccta^c ccccatgcct ccaggtggcc ttggatcctg ttggttctgt 49200 
gctctgctca gcgacctttc tcccgtggga gttcctgggg gttcagcttc atcctacaga 49260 
cagcagcaca cactggctgt gcaccctttt tttttttttt tttttttttt tgagatggag 49320 
tctcgctttt ttcgcgcagg ctgaagtgca gtggtgtgat cttggctcac tgcaacctct 49380 
acctcctggg ttcaagtgat tttcctgcct caccctccca agtagctggg attacaggct 49440 
cccaccacca cgcccggcta atttttgtat tttcagtaga gatggtgttt caccatgttg 49500 
gccaggatgg tcttgaactc ctgacctcag gtgatccgcc cacctcagcc tcccaaagtg 49560 
cagggattac aggcgtgagc caccacaccc ggagtgccgg ttgtttttag cagtttgtct 49620 
tgttcctgga gagactggct cctgcccagg agctcgggga gtagggccgc ggggtgctgc 49680 
ctcacacctc gagtttggcc gtaagcagag gggacatttt gtgactgtcc ccctcctgag 49740- 
cttcccagca gcttttctcc aagttacagc ccaaaagctc aggtggattt gcaacccaac 49800 
ggtgtctgtgcacctcccactgatgcccgaactgccctggccaagaaacggggccgtcag 49860 

aacgctgcac taactgcagc cttgggcctc catgccagag gccatgccct tccatccacc 49920 
accccctggc ctgggccctg ggccctcctg gctcgggaac tccaggcccc ttcctcacgg 49980 
ctcgagagac gtgtatttac cgcacaggtg cttgtcattc tcttgtggcc tcttctccag 50040 
ggagatcaca gaaggacagg gcctcactga ggtctcggac atggaccctt tgatagtggc 50100 
aggagccagg ctgggcaaga ggcggccaca gtcacctcag cagtgccatc accaccgcca 50160 
ttcagccctt ccctgagccg ggcgcgcccc tggctctggc cccagtgtcc cagttacagc 50220 
tcacaggagc ttgtggtgcc cagcggctgc ttctgattga gagtcgaggt cggaggcttt 50280 
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gggaggctga gaggctgctc ggtttcacaa ctgctgaggg agacttgggc tccatctcag 50340 
gtatgcccca tgtcgccctc aacctccagc caccggtcct ccgtgtcccc catggccagg 50400 
cacggcttgc agacatctgt cgttggctcc tctcagccgt cgtgggctga ccctggcacg 50460 
tcctcctgtg gctgagccca gtggggacag ctgcttcctt ttattaccct agaactctcg 50520 
tctttgatca ggccccctcc cctatgccac acagtccctg tcactcgggt gagcccagta 50580 
gtcatgggga aggcctgcgg gttccaaaca tccaaaggct tgcgtgcagc atgacagctt 50640 
gaaaccgatg ttttttacct tgatcagatt tcagcttggc gggggctttg ctcagctttc 50700 
agtgaggcct gggccgattt cccagcatcc cctcctgagg ccagcctctg tttcctgtga 50760 
ttttctgcac aaagtgggag ggaggagtcc taggaaatgg ggggccacct cgaagcctag 50820 
gcctcctctg gcttctctgt gccagtgccc ccacgctttg tgtctgtgtc cccagcccat 50880 
gggactctgc tattccctga gtgctgccgc atgcccagcc cgcactgagg acgtggagcc 50940 
ccgaggggca ggatggcctc catggtcaca cgtaggaagt ggcctccacc ctccgatgat 51000 
cctctccctc ctccctttca gcgccctccc cgggggtgtc ctcagccctc ctgcctgtgc 5 1060 
tttgtcccgt cttctgcagg cgcctgggac gtgctgacag gtcctctgcc ggctcctgcc 51 120 
ttgctatgcg cacgctggtc accacagagg cctggccctt cttctgtagc agtcccacac 51180 
ccgcaacagg tgtggctgct gaccacctgc tttctgcccc tctggtcctg aggagggcgc 5 1240 
agtgggcact caggcgtggc tgagcagatg tgtgttgccg ggaggaggaa ggactgctcc 51300 
agtcagggct gaatttccca cccggagcat ttctgctgta tttggtgtag cgcctgctgc 5 1360 
ttaaagctct gattcccagt tggcaccctt tcccttctgc attgaaaaac atacggatgc 5 1420 
atgtcttctt gcagtgaatg tgtattctcc cagcctctct tctgggttgg ggctggaggt 51480 
ggagcggcac acaggagccg cagcgatgga ggatgtgcgg gtgcagcacc ccgtacagca 51540 
gggatgccaa acccgcgctg agtccctctc aacttctgct ttgaagccca gtcacgccat 5 1600 
tgcctgggtt ttgctgggcg gggctgcgtg tgatgttctc ctctgtccct cccccagagc 51660 
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cgcccacctg ctccccggac cagtttgcat gtgccacagg ggagatcgac tgtatccccg 5 1720 
gggcctggcg ctgtgacggc tttcccgagt gcgatgacca gagcgacgag gagggctgcc 51780 
ccgtgtgctc cgccgcccag ttcccctgcg cgcggggtca gtgtgtggac ctgcgcctgc 51840 
gctgcgacgg cgaggcagac tgtcaggacc gctcagacga ggcggactgt gacggtgagg 51900 

- ccctccccgt caaggctctg. ccaagaccct ggccctgccc tccgggatac gagcttgggg 5 1960 

ctgcctccgg cctcacagga gtaggggctc tgaaaacctt tgcttgcagg gagattgcca 52020 
agtctgtctt ttaggcccaa caaggaaaac tctgcagttc cacccatcct gtcccaccag 52080 
. gtagtgtggcttgaaggcagactgtgagggtctatctcac cttcctgcattaggtcagga 52140 
gtttcacaga aacctgaggc acattcaggg gtgggctgca gaggtccatg gctoacaccc 52200 
tggaaaatcc gcccccaaaa gacagtgctg tctccactga ccagtctgtg ggatagtgct 52260 
taagcctgag tggtttctat caacatgtag aatcaggagg tataaagaga tttgctcagg 52320 
catcctgggc cctctctgac cagcaggatc ttcctttaga tcttgacagt gaaacacatc 52380 
tcttctgtgc cccctgtgag ttttctttca ttcattcatt cattcattca ttcattcatt 52440 
cattcattcg agacagagtc ttgctctgtc acccaggctg gagtgccctg gtgtaatctc 52500 
ggctcactgc aacctctgcc tccagggttc aatcgattct cctgcctcag cctcccgagt 52560 
agctgggatg acaggtgcgc accaccatgc ctggctaatt tttgtatttt tagtagagac 52620 
agggtttcac catgttggcc aggctggtct cgaactcctg acctcaggtg atccgcccgc 52680 
ctcagcctcc caaagtgc^ ggattacagg catgagccac cgcgcccggc ctgagttttc 52740 
cttttatgaa ggacctgctt ggttggttgc ctgccacatg ttgtcagcac catgggccca 52800 
ggactgctga ggagctgttg atgccctcgc tctcccagag ccaccggctc tgttagataa 52860 
ttcacatgca gtctggccac tgtcctacgt cctcattcac aaagagcaga catttcgtag 52920 
aagatgaggg cctgggagta acctccctgc atgtttttct ataaaggcat agtggttaag 52980 
tccttccagc tcattgacca ttggagaatt ttatggaggc tgtagactag gggctggtaa 53040 
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actaagggcc caggggccaa atccagcctg ccacctactt ttgtaaataa agttttcttg 53 100 
gtgcacagcc atgcccattc attcatttgc acaatgtctg tggctgcttt catgccaaaa 53 160 
gcaagagaac tgagtggtta tgctggagac ctacggcctt caaagcccca gacctcacgt 53220 
ctggcccttg acagacagag cttccccagc cctgctgcgc atcctggccc agcatgtgct 53280 
gtgtgtgtga tttcagcttg caggagccgt ggttaggaat tgtccctgtg ttggtccatt 53340 
ttgcattgct atgaaggagc acctgaggcc gggtagatta tgaaggaaag aggtctgtct 53400 
ggctcatggt tctgtaggca gcaccagtat ggcacccgca tctgctcagc ttctagtgag 53460 
gtctcaggaa gctttgactc atggtgaaag tcgaagcggg agcaggtgca tcacatggtg 53520 
agagagggag caacggagag agagagagag cgcctctccc tcttgccctc accttgagag 53580 
gagatgccag gctcctttaa gtaaccagct cccatgtgaa ctcacagtga gagcccattt 53640 
gctactgcgg agagggcacc aggcatctgc tcccatgacc caaacactgc ccaccaggcc 53700 
ctacctccaa ccttggggtc atattttatt ctgttctatg ctatgctatg ctatgccatg 53760 
ccatgccatg ccatgctatt cctattctat tatttgagac agaatctcgc tctgttgccc 53820 
aggctggagt gcagtggcat gatcttggct cactgcaacc tccacctccc aggttcaagc 53880 
gattctcctg tatcagcctc ccgagtagct gggattacag gcacacacca ccacacccgg 53940 
ctaatttttg tattttcaat agagatgggg tttcaccatg ttggccaggc tggtctcaaa 54000 
ctcctggcct caagtgatcc acctacctcg gcctcccaaa gtgccatgat tacagatgtg 54060 
agtcactgcg cccagtgagg gtcacatttc cgttgagatt tggaggggca gacgttggag 54120 
ccatctgagc cccctcgtcc cgctctagct tctcctcccg tgtgccccgc ggtgctggtg 54180 
gcaggccctt acgccggttc tggctgcatg ctctgttcca gaagctttct tccctgcttg 54240 
gttacxagaa aatcatccca tccattacaa ggacagggtc cccttatctc ccattcccag 54300 
ggcaggacac cgggggcagg gcaggtgggg aactgagcaa gttctctggg ggcaggcgtg 54360 
gctatggctc cctctgggtg ggcgtctggg gaggggtgga ggcagccgtc agcgccctgg 54420 
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cttgctcttc ctccctggcc agagactgtg gccttgtgct gctcccgtgt gggctgcctg 54480 
cacctccagt gggttgtgct ccctcccctc ccctcccctc aagctctgct gagcaccact 54540 
gccttccaca gcccccactc tcgggaggcg aggctcctcg tggccattcc tgtccttggc 54600 
acccaccccc ccaccaacct ggtagagcct tgggcggggt ctgttactcc ttgcatggcg 54660 

" tagacctccc cacagtaggc acctgaeaca tacctcctgg ggggcaggca ggaggtgcgt 54720 

tgaggtctca gccctggcag tccctcccct gcgtggcata ggcctcgcca cagggtcatc 54780 
gagggtgggt ggagactgta ctagaccact ccccgctggt cctagaaagg gtcccatctg 54840 
tctgctctct gtttggagtc cagaccttgg ttgctgtgcc ctgcatggtg ggctgggggg 54900 
caccctccag cctctctgag tgcatggcct ctccttgcag ccatctgcct gcccaaccag 54960 
ttccggtgtg cgagcggcca gtgtgtcctc atcaaacagc agtgcgactc cttccccgac 55020 
tgtatcgacg gctccgacga gctcatgtgt ggtgagccag cttctggcac ggggaagggg 55080 
cgtccgggct gggttccccc aggaacgtgg agtttagggg aggagacgtg cctttccagc 55140 
ggggctgggg gctgtgtggg agactcaggc ggctgggagg ctccttgcgg gaggcaggga 55200 
agcctttccc agggcagcgg ccaggaggac agactgtgag ctgtgggctc ggcggctaca 55260 
gagtctgcct cagtgggcgg ggctgatggt gtccaggtgc ctgcagcacg cacccaccca 55320 
cgggaccttg ctgagcagcg tctgtcaggc agcaagatta cccgagggct gcagtggtcc 55380 
tgttccctgg cagcttactg tctggctgag gaggagtgat gttcacatat gcacacatgt 55440 
catgtgcaca cacatgtaca tgacaacatc ccacatgctc ctcaaatagc atgacctgta 55500 
cagtcacgga tatagggcct aggggatagg aggccaagac agtcagggaa gactttccag 55560 
aggcagtggc tcctgaaagg ctgtotgatt caggcaggaa gggagctgag ttcagatagg 55620 
aagtagcaat gagtcattgt gtctggggac atggccactc cttcgctgca gagggacctg 55680 
ggctgagagc tcctctctta tggctgcagt cgggagagaa gtctgttggg gggagaaggg 55740 
ggcttcctca agggactccc tgtgcccttt ggcaccttcg tgccaggtca ggcttgaggc 55800 



"^OOl/SlSn PCT/USOl/16946 

218 

ctgaaggcag tggtgggggc caccaagggt cgcctcctct gctgggcaag ttcccagtct 55860 
gacgggcctg tgccgtgggc cccagctgtg ggggcgctgt tgatgcgcag ccaggcctcg 55920 
ccgccagagc ccgcacgctt ccattccgct gacttcatcg acgccctcag gatcgctggg 55980 
ccggccctgt gggagagtga atgtggcttt tgccaaagtt gagtctggag cctggaaact 56040 
tccctatggg cagccttgat agtggagtgg cccaaggagc ccacccagcc gaccctgccc 56100 
ctcccgtggc tggtgggcgg caccaggggc tgcctggctt tgctcgttca ccaacatcac 56160 
ctgggctggc cagggcgcgc tcacttctgc caccaccgag ggccctgggc gaaggagtga 56220 
ataccaggct gccttggcag ggatgtgttg agggctgtgg ggagtcggac agcggcgggg 56280 
gtcagaggag gaggagggtg caccgtgcag gctgaagggc cacgttaccc tgaggttggc 56340 
caggctcccc aggcctagcc tcccagctcc cccactttot ccccaccctc caccagtggc 56400 
aaagccagcc ccttcagggc gcacggtgtc tgcccccaag gagggcccat tccgttgggg 56460 
ttaatgttgg ccacctcttt ctgtttgtct ctggcagaaa tcaccaagcc gccctcagac 56520 
gacagcccgg cccacagcag tgccatcggg cccgtcattg gcatcatcct ctctctcttc 56580 
gtcatgggtg gtgtctattt tgtgtgccag cgcgtggtgt gccagcgcta tgcgggggcc 56640 
aacgggccct tcccgcacga gtatgtcagc gggaccccgc acgtgcccct caatttcata 56700 
gccccgggcg gttcccagca tggccccttc acaggtaagg agcctgagat atggaatgat 56760 
ctggaggagg caggagagta gtctgggcag ctttggggag tggagcaggg atgtgctacc 56820 
ccaggccctc ttgcacatgt ggcagacatt gctaatcgat cacagcattc agcctttccc 56880 
actgagcx;tg tgcttggcat cagaatcctt caacacagag gcctgcatgg ctgtagcaac 56940 
ccaccctttg gcactgtagg tgtggagaaa gctccttgga cttgaccttc atattctagt 57000 
aggacatgtg ctgtgttgtc cacaaatcct catgtaccct agaaatgaat gtgggggcgg 57060 
ctgggctctc tccagagctg aaggaatcac tctgtaccat acagcagctt tgtcttgagt 57 120 
gcagctggga tttgtggctg agcagttaca attcctacgt ggcccaggca ccaggaacgc 57180 



wo 01/92891 PCT/USOl/16946 

219 

aggctgtgtt tgtagatggc tgggcagccg caccgcagag ctgcaccatg ctggtttgta 57240 
tcacatgggt gaccatggta tgtctaagaa ggtggagtcc ctgtgaggtc tgcaggtgcc 57300 
cccacagctc caggccacct tgaggattgc ctctgcctgc ccagccctga gttccctctc 57360 
ccctgtcctg tcccactgtc accccaagcc ggcctcattg ggagcctgtt ggatggcagg 57420 
gtatagatgt aacctgattc tctctgggga gcggggttat ctggcttctc aagagctcct 57480 
aggagcccac agtggtggca ccatcacagt cgcagcagcc cccagagaac gcggccctgt 57540 
ctgttcctgg cgtgctctgt gctgccccgc ctgggttccc tgccccagtc gcaggcccct 57600 
tggaggaggt accatgtgtc tcccgtttca cagatgagcc ccggggagct cactctagta 57660 
gtggccagag aggcctgcgg ctcagggagc ggggcacatt tccaacagga cacaccgccc 57720 
tggtctgagt ctcgtgggta gtgggagcag aggagagcgc cctatgtctg tggggcggct 57780 
tggctgagcc tggaagccac ctgacctccc ccgtcccttc cctgccaggc atcgcatgcg 57840 
gaaagtccat gatgagctcc gtgagcctga tggggggccg gggcggggtg cccctctacg 57900 
accggaacca cgtcacaggg gcctcgtcca gcagctcgtc cagcacgaag gccacgctgt 57960 
acccgccggt gaggggcggg gccggggagg ggcggggcgg gatggggctg tgggcccctc 58020 
ccaccgtcag tgctggccac cggaggcttc ccgggttcct gggggctgtg ccaccgcctc 58080 
tgaggcatgc ttgctttctt cccttttcaa acccttctgc ttccttcttt aatgacattg 58 140 
ttgattgtgg ataatctgaa aactacacaa aaatataaag agccaaaatc tcacccaaat 58200 
ccacctccta gagtggctgt tgggctccgt cagcatccag gcggccgtct gtgttocgca 58260 
cggcccagcc catcgatagc cgcctgcacc aggcctgtct gccctctgtg agcctcccca 58320 
cagggttccc tccacaaaca ccctgttctc ccacccaggg ctggctgctt cctggaaaac 58380 
agctggatgg ttttgtgcat gacagacaaa cacagggtga ttttcgtggc taaaatactc 58440 
cctggagctt ttggcagggt gaggggctgg ctccagctga gccacgcctt gagtgaaatg 58500 
actgtgagga gaataaactg ccgctgcxct ccaggatcae tggggctggc tggggagaac 58560 
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ccccgtttct gggagcacag tcccaggatg ccaaggcgag cttggtgccg agatgtgaac 58620 
tcctgagtgt aaacagcggg ggctgacttg acatgctttg tatgcttttc atttgttcct 58680 
gcagctgtat gcccctaagg tgagtccagc ccccttctgc ttcctctggg gcctcgccag 58740 
tgagccccac cttgctgggg ctggttcctc ctgcccttct gggtatccct cacatctggg 58800 
gtcttgtctt cttgttttct ttttcttttt tttttgagac ggagtttcac ttttgttgcc 58860 
caggcttcag tgcaatggtg tgatctctag gctcaccgca acctctgcct cccaggttca 58920 
agcagttccc ctgcctcagc ctccctagta gctgggatta caggcatgtg ccaccacgcc 58980 
cagctaattt tgtattttta gtagagatgg ggtttctcca tgttggtcag gctgatcttg 59040 
aactccctac ctcaggtgat ccgcccacct tggcctccca aagtgctggg attacaggcg 59100 
tgagccaccg cacctggcct ttttcttttc ttttcttttc ttttttctga gacagggtct 59160 
cgctctgtca cccaggctgg agtgcaatgg tgtcatcatg gctaactgca gcctctacct 59220 
tctaggctca agcaatcctc ccatctcagc ccctaagtag ctaggactgc acgcatgcat 59280 
ccccatgccc agctaatatt tacatttttt gtagagatga agtttcacta tattgcccag 59340 
gctggtctcc aactcctgga ctcgagcgat cctcctgcct cggcctcccc aggtgctggg 59400 
attacaggcg tgagccaccg tgcctggcct ggggtattgt cttcttatgg cacctgactg 59460 
tggtgggccc tgggaaggaa gtagcagaag agggttcttc ttggtttcct ggacagtaac 59520 
tgagtgttct ggaggcccca gggcctggct ttgtttaggg acaaagggaa ctggtaacca 59580 
gaagccgaga gtttaaacac ccactgccct tcttccctgc tcctgctgct gcaacccagc 59640 
ttaaccagcc aggagtgcta ggaacccaag cagggccccc gagcacacag caggcagctc 59700 
acgaattctc ttttcctgtt ctcccttggg agctgggagg atcttaatca ggcaataaga 59760 
gatggcactg agcagccagc taatttttta aatcacttta ttgtttaacc atatgactca 59820 
cccacttaaa aaagggtaca gttcagtggg ttttagtgta ttcacagatg tgtgcaaccc 59880' 
tcaccacagt taattttaga acattttcct gcccctaaaa gaaactctgc atgaagccag 59940 
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ctgtttttaa attagcaaag ttattttgca tcctttaaat atatgttcat ggtacaaaat 60000 
tcaaaagata cagaagagtc tgcagtccaa agagactccg cccccatgac gccaagcagg 60060 
catccctggg aggcatggcc tcctgcagtg tgtttcttct atgtcccccc aggggtcato 60120 
tgtacatatg caagcataca agagcgtgga ctttgttttc caagccagaa gataattgta 60180 
gatttatgtg cagttgtgag aaagagcaca gacccattta tcctctgcct ggtttccccc 60240 
agtgctgcct gccatcttgc atgacttcca ttcctatcat aagcaagaca ctgataacga 60300 
ttctttcacc ttattcagat tgacataagt gttttttgtt tgttcttgag acaaacttcc 60360 
tctgtcaccc agtgggagtg cagtggcaca atcacagctc actgcagcct caaactcctg 60420 
ggctcaagcg attctcctgc ctcagtcccc tcaagtagct cagatggcag gtgtgcacca 60480 
tcatgccagg ctaattttta aattttttgt ggaggtgagg cctcactaaa tttcctgggc 60540 
tagtcttgaa ctcctgagct aaagtgatcc tcctgcctca gcctcccaaa gtggtaggat 60600 
tacaggcatg agccactgcg cctgggctga catatgtgtt ttcgtaagcc cgaaagatag 60660 
catctgaaga gtcaacattg agccttgcct tttgctgcta acgatgtata aaagctgctg 60720 
ttctgagcat ttcggaggct cccagctgcc gtgtgcaccc tgcctagagc tctaccgtaa 60780 
cccatctccg ggaggaggtg ctattgtttt cctcattttg caacaaggag gctgaagaac 60840 
tgagcatgaa ccactggcct gggtcgttcg gttggtaggc agtggggcca ggccatccaa 60900 
ctcacaacca ccttctactc tgcttccccc gcaccctgaa gtttgttctg ttttgaggac 60960 
acagccgtca cattcttggt ggctgaacag cactccttgt caggcgtggc tgggccccca 61020 
ctggagggca tcatggtcct ctctcctgct gcggttgaac cttggctgtt tcaaccactc 61080 
ctgccaagtg gccctctgaa agggacagtc catcttttct cagcagaggg ccacactggc 61 140 
aaaacggtcc ctggcaccct ttctctccac ctgtctaata tagagtaaaa atggtatcat 61200 
gttaagatct tcatttatat ttattttatc atgaatgatg taagcatcat tttgtgtgtt 61260 
. taagaacctt tgggcccagc gtgatggctt gcagctgtaa tctcagcact ttaggaggct 61320 



wo 01/92891 

222 

gagatgagcg gatcacttga ggccgggagt ttgagaccag cctggccaac atggagaaac 61380 
cccgtctcta gtaaaaattt aaaaattagc cgggtatggt gatcccagct acttgggagt 61440 
ctgaagcatg agaattgctt gaacatggga ggcggaggtt gcagtgagcc gagatcgcgc 61500 
cattgcactc cagcctgggc gacagagcga gactctgtct caaaaaaaaa aaaaaaaaag 61560 
aaaagaaaag aaattatcaa tctcctcttt tatggcatat atatatatat atatatatat 61620 
atatatatat atatatattt tttttttttg gttatgttca gaaaggcctt ccctgctctg 61680 
atcataaaaa acaacttatt ttcacactct ctctcttttt tttttgagac agagttttgc 61740 
tcctgttgcc caggctggag tgcagtggcg caatctcagc tcactgtaac ctccgcctcc 61800 
cgggttggag tgattctcct gccttacctt cccgagtagc tgggattata ggcatgcacc 61860 
accatgcctg gctaattttg tacttttagt agagacgggg gtttctccat gttggtcagg 61920 
ctggtctcga actcgcgacc tcaggtgatc cacccacctc ggcctcccaa agtgctggga 61980 
ttacagacgt gagccaccat gcccagccca cactctcttt cttaacgtcc tcctcctttc 62040 
gttttacgtt cacatcttta attcttctgg gatgtaatta gatttgatga gcaaggtggg 62100 
catccagctt gtttcttggc tgatggctta tgggtggcgt gaattagtcg gggtctatca 62160 
ggaggcagaa actctatgag aatttgaaca gagaaagttc cgtctacagg cttattacca 62220 
jgggactggaa tagcagaaat tgaacagtga gatgtacaga gaactctaag aatgcaggaa 62280 
taggccaggc atggtggctc acacctgtca tcccagcact ttgggagacc aaggcgggtg 62340 
gatcacctga ggtcaggagt tcgagaccag cctggccaac atagtgaaac cccatctcta 62400 
ctaaaaatac aaaaaaatta gctgggtgtg gtggcgcatg cctgtaatcc cagctactcg 62460 
ggaggctgag gcaggagaat cacttgaacc tgggaggcag aggttgcagt gagccgagat 62520 
catgccactg tactccagcc tgggtggaag agcggaactc tgtctgaaaa aaaaaaaaaa 62580 
aacaagaagt tcaacttgaa gggaaaaatg ccgtattgtc tttccctttg ttatgtcacc 62640 
agggcacagt ccatcccagg ctggcgctga tccacgggct ggagaggggc tgccccagaa . . 62700 
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gaggacatgc caggaagggc ttggctggtg ttcaggagcc caggccaggt caggtcaaga 62760 
ggtgttgagg ctggacggga gaggccagct aggggctcat gtaggatatg aggggtcggc 62820 
ccatttcaac gtggaaactg agctcttctg cttctctttc ttcttcactg cattaagatt 62880 
caataccgct tgggaagcag gtatttccct tcctataaag gatggttggg agcctgagtg 62940 
ttgggagaaa gtgtagccgc tgagttacta acaactaggg ctgccgtcaa gcctatgggg 63000 
aaagagagaa gaggacattt ggaaggagag agatcaagct gtggcaccct gggagaggac 63060 
cacagaaaag aggccagtga gggggttccc cggtggcatc tgaaggtgtg gcccaaccag 63120 
gaggtccaga ggctgccagc cgagtggccc aggagaggga acctcacagg ggctgagtgg 63180 
gacccaagcc ctatccaccg tcctaaccac ccacatttct cgggaacaag acctcccaca 63240 
gtggcctccccggcagtggaaatagccaaactggcaacatggactttcttcaactgcccg 63300 

ggcgatgctg cctcagtgcc ccagggcagg caggaagctc ccacacccat tctggaatga 63360 
ggggttggag gaaggctgag ctgagcaaag gacccatctc tgctctggtt ggtggggagg 63420 
gagcccatta tacaagagac ccctcagggc tcagtgaggg gtgacagaga cttggggagt 63480 
agtggctgtcactgcagaggtgagagggtttggagagaaggtacatgcctttttggccac 63540 

attgagtagc acctggtagc cagttagtaa cgtgtattgg ataaacaaaa gattaaacgg 63600 
atgcaaaaaaaaatgttggctttgcttctttttacccaaacctcagttccctcaagtaga 63660 
ttctgggaac accccctacc tggctggact gttgtgaagt ttaaataagc caggttaact 63720 
tcacctcctc ctttaagaca cagctcagac actgcctcct ccaagaagcc ccctctggct 63780 
tcctgtgtga atatgacggc cctctgggct ctagggtatc ttagaacaat gcttccttat 63840 
ggctttggaa ccccgctgtc tcctggattg ggagcaaatg caggggagga gccacacctg 63900 
actaatctct gggtctccca gcacataagt ggcataaggg cagggctgtg cccgcttcag 63960 
gcacttactg aaggatgtac ttggcagagg gtaggcagcc ggcgga^ag cccctcactc 64020 
tccccagctg actgcgtggg cgggaaaggc gggttcagga gacccagcct ccctgggctg 64080 
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tcacqacctc tgcacatcca gccccattga tcaagggttc aatttttggg gtcctgttgg 64140 
gaggccagga gactctctcc aggcacttct tccaggtctt tgtgttaggg tgtgtgtgtg 64200 
tgtgtgtgtg tgtgtgtgtg tgtgttgttt gttttatttt atttatttat ttatttattt 64260 
atttatttat ttatttattt tgagacgcag tctcgctctg ttgcccaggt tggagggtgg 64320 
tggcatgatc tcggctcact gcaagctccg cctcccgggt tcacgccatt ctcctgcctc 64380 
actcttcctg agtagccgga ttacaggcgc acgcaccatg cctggctaat tattttgttt 64440 
ttttagtaga gacagggttt cgccacgttg cccaggctgg tcttgaatcc ctggcctcaa 64500 
gcgatccgcc cgcctcagcc tcccaaagtg ctgggattac aggcgtgagc caccgtgccc 64560 
gcccagccta ggggtacatg aaactttttt LLIILLILIL ttgagacaga gtttcactct 64620 
gtcctcaggc tggagtgcag tggcgtgatc tcggcgtact gcaatctccg cctcccggtt 64680 
caagcgattc tcctgcctca gcctcccgag tagctgggat tgcaggcacg cgccaccaca 64740 
cccagctaat ttttgtattt ttagtagaga cgggctttca ccatgtggga caggatggtc 648Q0 
tcgatctcct gacctcgtga tccgcccgcc tcagcctccg aaagtgctgg gattacaggc 64860 
ctgagccacc gtgcccagcc atgatgtttt gatacaggca tataacgtat aataatcaca 64920 
tcagggtaaa tgatgtaacc atcacatcaa gcatttatcc tttgtgttac aaaaaaaaat 64980 
ctaattatac tttcctactt attctttttt tttttttttt ttgagacgga gtctccctca 65040 
gtcgcccagg ctggagtgca gtggcatgat ctcagttcac tgcaagctct gcctcctagc 65100 
tctgcctcct gggttcatgc cattctcctg tctcagcctc gcgagtagct gggactacag 65160 
gcgcctgcca ccgtgcccgg ctaatttttt tttttgtatt tttggtagag acagggtttc 65220 
accgtgttag ccaggatggt ctcgatctcc tgacctcata atccgcccgt ctcggcctcc 65280 
caaagtgctg ggattacagg catgagccac cgcccccagc ctatttattc ttaaatgtac 65340 
aataaattat tgttgactcc agtcaccctg ctgtgctacc aaatacggat cttcttcatt 65400 
ctatctaact gtatttctgt acctgttaac catctctcct ccacctcacc ccccaaaccc 65460 
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actacccttc tcagcctctg gtaaccatcc ttctactctc tatctctatg agttcaattg 65520 
tattaatttt tagctccccg gccgggcacg gtggctcacg cctgtaatcc cagcacttca 65580 
ggaggctgag gcaggtggat cacgaggtca ggagtttgag accagcctgg ccaacatggt 65640 
ggaaccccat ctctactaaa aacacaaaaa ttagctgggc gtggtggtgg gcgcttgtag 65700 
tcccagctac ttgggaggct gaggcaggag aatcgcttga aactgggagg cagaggttgc 65760 
agtgagccaa gattgcgcca ctgcactcca gtctgggtga cagagtaaga ttccatcccg 65820 
aaaaaaaaaa agtttagctc ccacaaataa gtgagaacac gtgaagtttc tctttctgtg 65880 
cctcgcttgtttcacttaacataatgacctccagttccatccacgttgttgctttgttat 65940 
aaatgacaggatcttggtcaggcgcagtggctcatgcctgtaatcccagcactttgggag 66000 
gctgaggtgg actgatcatg aggtcaagag atcgagacca tcctggctaa cacagtgaaa 66060 
ccccgtctct actaaaaata caagaaatta gccgggcgtg gtggtgggca cccatttccg 66120 
ccccttctcg ggacgctgat gcacgacata ttacccatcc ccggaagact aatcctcccc 66180 
cactctatat tgtacctctt cctttctcct ccacgcgatt ccccgagtaa cccgtcttcx 66240 
ctccctcctc ggattacgct cacctttccg cttcaatcac gttgctccgt ccccttcccc 66300 
attcgtacca ctcctcactt tcgtcttcct acccccacta tcccttttcg tcctctctat 66360 
tccttactta ctcctccccc ttctcttcat acttcattcc ctccgctctt cccactcgcg 66420 
ctcccacttt cacctagttg ccctcaccta cgttgccatc tcgccccttc ttcagctctc 66480 
ggcctctcac ccatctgtcc totctcttac ctctctcctc atctcgctca gacatctctc 66540 
tagactatcc ctcactttac cttctcagtc gtcttcttcc tatccttcgt tctccatgat 66600 
cttcacgtcg ccatctcttt tcgccccttt catatgtctc tcttcatgtt ctcactatca 66660 
ttctcatgat cactatcgtt ctcactactt atcactcccc tctttcttca tcaattcctc 66720 
tccgtcattc tcgtctctct cttacaaccg ccttccttgt gctatctaac tcaaccatgc 66780 
ctctcctact ctctctctat cgcccctcca tcgcttatgc atcctcttct attgcacacc 66840 
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cgcccctcca tcgcttatgc atcctcttct attgcacacc gcccctccat cgcttatgca 66900 
tcctcttcta ttgcacatcc tcttctattg cac 66933 

<210> 12 

<211>21 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223 > Artificial sequence is a primer. 
<400> 12 

ctgagcggaa ttcgtgagac c 21 

<210> 13 

<211> 23 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223 > Artificial sequence is a primer. 
<400> 13 

ttggtctcac gtattccgct cga 23 

<210> 14 

<211> 20 

<212> DNA 

< 213 > Artificial Sequence 
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<220> 

<223 > Artificial sequence is a primer. 

<400> 14 
ctcgagaatt ctggatcctc 

<210> 15 

<211> 22 

<212> DNA 

<213 > Artificial Sequence 

<220> . 

< 223 > Artificial sequence is a primer. 

<400> 15 

ttgaggatcc agaattctcg ag 

<210> 16 
.<211> 21 
<212> DNA 
<213> Artificial Sequence 

<220> 

< 223 > Artificial sequence is a primer. 
<400> 16 

tgtatgcgaa ttcgctgcgc g 

<210> 17 
<211> 23 
<212> DNA 
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< 213 > Artificial Sequence 
<220> 

< 223 > Artificial sequence is a primer. 
<400> 17 

ttcgcgcagc gaattcgcat aca 23 

<210> 18 

<211> 21 

<212> DNA 

<213> Artificial Sequence 

<220> 

< 223 > Artificial sequence is a primer. 
<400> 18 

gtccactgaa ttctcagtga g 21 

<210> 19 

<211> 23 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223 > Artificial sequence is a primer. 
<400> 19 

ttgtcactga gaattcagtg gac 23 

<210> 20 
<211> 21 



21 
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<212> DNA 

< 213 > Artificial Sequence 
<220> 

<223 > Artificial sequence is a primer. 

<400> 20 
gaatccgaat tcctggtcag c 

<210> 21 
<211> 23 
<212> DNA 

< 213 > Artificial Sequence 
<220> 

<223 > Artificial sequence is a primer. 
<400 > 21 

ttgctgacca ggaattcgga ttc 

<210> 22 

<211> 33 

<212> DNA 

<213 > Artificial Sequence 

<220> 

< 223 > Artificial sequence is a primer. 



23 



<400 > 22 

cuacuacuac uactgagcgg aattcgtgag acc 33 



<210> 23 
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<211> 32 

<212> DNA 

<213> Artificial Sequence 

<220> 

< 223 > Artificial sequence is a prinnier. 
<400 > 23 

cuacuacuac uactcgagaa ttctggatcc tc 32 

<210> 24 
<211> 33 
<212> DNA 

< 213 > Artificial Sequence 
<220> 

< 223 > Artificial sequence is a primer. 
<400> 24 

cuacuacuac uatgtatgcg aattcgctgc gcg 33 

<210> 25 
<211> 33 
<212> DNA 

< 213 > Artificial Sequence 
<220> 

< 223 > Artificial sequence is a primer. 
<400 > 25 

cuacuacuac uagtccactg aattctcagt gag 33 
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<210> 26 

<211> 33 

<212> DNA 

<213> Artificial Sequence 

<220> - 
<223 > Artificial sequence is a primer. 

<400> 26 

cuacuacuac uagaatccga attcctggtc age 

<210> 27 

<211> 45 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223 > Artificial sequence is a primer. 

<400 > 27 ^^^^^^ 
aactggaaga attcgcggcc gcaggaattt tttttttttt ttttt 

<210> 28 

<211> 13 

<212> DNA 

<2I3> Artificial Sequence 

<220> 

<223 > Artificial sequence is a primer. 
<400 > 28 
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aattcggcac gag 23 

<210> 29 

<211> 9 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223 > Artificial sequence is a primer. 

- <400> 29 
cfcgtgccg 9 

<210> 30 

<211> 14 

<212> DNA 

< 213 > Artificial Sequence 

<220> 

<223> Artificial sequence is a primer. 
<400> 30 

gtacgacggc cagt 24 

<210> 31 

<211> 16 

<212> DNA 

< 213 > Artificial Sequence 

<220> 
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<223 > Artificial sequence is a primer. 

<400> 31 
aacagctatg accatg 

<210> 32 

<211> 18 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223 > Artificial sequence is a primer. 

<400 > 32 
ccaagttctg agaagtcc 

<210> 33 

<2il> 20 

<212> DNA 

<213 > Artificial Sequence 

<220> 

< 223 > Artificial sequence is a primer. 

<400 > 33 
aatacctgaa accatacctg 

<210> 34 
<211> 57 
<212> DNA 

< 213 > Artificial Sequence 
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<220> 

< 223 > Artificial sequence is a primer. 
<400> 34 

agctgctcgt agctgtctct ccctggatca cgggtacatg tactggacag actgggt 57 

<210> 35 

<211> 56 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223 > Artificial sequence is a primer. 
<400 > 35 

tgagacgccc ggattgagcg ggcagggata gcttattccc tgtgccgcat tacggc 56 

<210> 36 

<211> 27 

<212> DNA 

< 213 > Artificial Sequence 

<220> 

<223 > Artificial sequence is a primer. 
<400 > 36 

agctgctcgt agctgtctct ccctgga 27 

<210> 37 
<211> 27 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223 > Artificial sequence is a primer. 
<400 > 37 

gccgtaatgc ggcacaggga ataagct 

<210> 38 

<211> 20 

<212> DNA 

< 213 > Artificial Sequence 

<220> 

<223 > Artificial sequence is a primer. 

<400 > 38 
gagaggctat atccctgggc 

<210> 39 

<211> 20 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Artificial sequence is a primer. 

<400 > 39 
acagcacgtg tttaaagggg 

<210> 40 
<211> 163 
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<212> DNA 
<213> Homo sapiens 
<400 > 40 

actaaagcgc cgccgccgcg ccatggagcc cgagtgagct cggcgcgggc ccgtccggcc 60 
gccggacaac atggaggcag ctccgcccgg gccgccgtgg ccgctgctgc tgctgctgct 120 • 
gctgctgctg gcgctgtgcg gctgcccggc ccccgccgcg gcc 163 



<210> 41 
<211> 419 
<212> DNA 

< 213 > Homo sapiens 
<400 > 41 

gccccacagc ctcgccgctc ctgctatttg ccaaccgccg ggacgtacgg ctggtggacg 60 

ccggcggagt caagctggag tccaccatcg tggtcagcgg cctggaggat gcggccgcag 120 

tggacttcca gttttccaag ggagccgtgt actggacaga cgtgagcgag gaggccatca 180 

agcagaccta cctgaaccag acgggggccg ccgtgcagaa cgtggtcatc tccggcctgg 240 

tctctcccga cggcctcgcc tgcgactggg tgggcaagaa gctgtactgg acggactcag 300 

agaccaaccg catcgaggtg gccaacctca atggcacatc ccggaaggtg ctcttctggc 360 

aggaccttga ccagccgagg gccatcgcct tggaccccgc tcacgggtaa accctgctg 419 



<210> 42 
<211> 221 
<212> DNA 
<213> Homo sapiens 
<400> 42 

ccccgtcaca ggtacatgta ctggacagac tggggtgaga cgccccggat tgagcgggca 60 
gggatggatg gcagcacxcg gaagatcatt gtggactcgg acatttactg gcccaatgga 120 
ctgaccatcg acctggagga gcagaagctc tactgggctg acgccaagct cagcttcatc 180 
caccgtgcca acctggacgg ctcgttccgg taggtaccca c 221 



<210> 43 



wo 01/92891 

237 



<211> 221 
<212> DNA 
<213> Homo s^iens 
<400 > 43 

tccctgactg caggcagaag gtggtggagg gcagcctgac gcaccccttc gccctgacgc 
tctccgggga cactctgtac tggacagact ggcagacccg ctccatccat gcctgcaaca 
agcgcactgg ggggaagagg aaggagatcc tgagtgccct atactcaccc atggac^tcc 
aggtgctgag ccaggagcgg cagccttttt gtgagtgccg g 221 
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60 
120 
180 



<210> 44 



<2ll> 156 



<212> DNA 

< 213 > Homo sapiens 

<400> 44 

tttctcagtc cacactcgct gtgaggagga caatggcggc tggtcccacc tgtgcctgct 
gtccccaagc gagccttttt acacatgcgc ctgccccacg ggtgtgcaga tgcaggacaa 
cggcaggacg tgtaaggcag gtgaggcggt gggacg 15^ 



<210> 45 
<211> 416 
<212> DNA 
<213> Homo sapiens 

<400 > 45 

ctccacagga gccgaggagg tgctgctgct ggcccggcgg acggacctac ggaggatctc 50 
gctggacacg ccggacttca ccgacatcgt gctgcaggtg gacgacatcc ggcacgccat 120 
tgccatcgac tacgacccgc tagagggcta tgtctactgg acagatgacg aggtgcgggc 180 
catccgcagg gcgtacctgg acgggtctgg ggcgcagacg ctggtcaaca ccgagatcaa 240 
cgaccccgat ggcatcgcgg tcgactgggt ggcccgaaac ctctactgga ccgacacggg 300 
cacggaccgc atcgaggtga cgcgcctcaa cggcacctcc cgcaagatcc tggtgtcgga 360 
ggacctggac gagccccgag ccatcgcact gcaccccgtg atggggtaag acgggc 416 
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<210> 46 
<211> 198 
<212> DNA 
<213> Homo sapiens 
<400 > 46 

ttcttctcca gcctcatgta ctggacagac tggggagaga accctaaaat cgagtgtgcc 60 
aacttggatg ggcaggagcg gcgtgtgctg gtcaatgcct ccctcgggtg gcccaacggc 120 
ctggccctgg acctgcagga ggggaagctc tactggggag acgccaagac agacaagatc 180 
gaggtgaggc tcctgtgg 198 



<210> 47 
<211> 244 
<212> DNA 
<213> Homo sapiens 
<400> 47 

ccgtcctgca ggtgatcaat gttgatggga cgaagaggcg gaccctcctg gaggacaagc 60 
tcccgcacat tttcgggttc acgctgctgg gggacttcat ctactggact gactggcagc 120 
gccgcagcat cgagcgggtg cacaaggtca aggccagccg ggacgtcatc attgaccagc 180 
tgcccgacct gatggggctc aaagctgtga atgtggccaa ggtcgtcggt gagtccgggg 240 
ggtc 244 



<210> 48 
<211> 313 
<212> DNA 
< 213 > Homo sapiens 
<400 > 48 

gttcgcttcc aggaaccaac ccgtgtgcgg acaggaacgg ggggtgcagc cacctgtgct 60 
tctgcacacc ccacgcaacc cggtgtggct gccccatcgg cctggagctg ctgagtgaca 120 
tgaagacctg catcgtgcct gaggcctttt tggtcttcac cagcagagcc gccatccaca 180 
ggatctccct cgagaccaat aacaacgacg tggccatccc gctcacgggc gtcaaggagg 240 
cctcagccct ggactttgat gtgtccaaca accacatcta ctggacagac gtcagcctga 300 
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aggtagcgtg ggc 313 

<210> 49 
<211> 255 
<212> DNA 
<213> Homo sapiens 
<400 > 49 

cctgctgcca gaccatcagc cgcgccttca tgaacgggag ctcggtggag cacgtggtgg 60 
agtttggcct tgactacccc gagggcatgg ccgttgactg gatgggcaag aacctctact 120 
gggccgacac tgggaccaac agaatcgaag tggcgcggct ggacgggcag ttccggcaag 1 80 
tcctcgtgtg gagggacttg gacaacccga ggtcgctggc cctggatccc accaaggggt 240 
aagtgtttgc ctgtc 255 

<210> 50 
<211> 210 
<212> DNA 
<213> Homo sapiens 
<400> 50 

gtgccttcca gctacatcta ctggaccgag tggggcggca agccgaggat cgtgcgggcc 60 
ttcatggacg ggaccaactg catgacgctg gtggacaagg tgggccgggc caacgacctc 120 
accattgact acgctgacca gcgcctctac tggaccgacc tggacaccaa catgatcgag 180 

tcgtccaaca tgctgggtga gggccgggct 210 

<210> 51 
<211> 352 
<212> DNA 
<213> Homo sapiens 
<400 > 51 

gtgttcatgc aggtcaggag cgggtcgtga ttgccgacga tctcccgcac ccgttcggtc 60 
tgacgcagta cagcgattat atctactgga cagactggaa tctgcacagc attgagcggg 120 
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ccgacaagac tagcggccgg aaccgcaccc tcatccaggg ccacctggac ttcgtgatgg 180 
acatcctggt gttccactcc tcccgccagg atggcctcaa tgactgtatg cacaacaacg 240 
ggcagtgtgg gcagctgtgc cttgccatcc ccggcggcca ccgctgcggc tgcgcctcac 300 
actacaccct ggaccccagc agccgcaact gcagccgtaa gtgcctcatg gt 352 



<210> 52 
<211> 225 
<212> DNA 
<213 > Homo sapiens 
<400 > 52 

gcctcctcta cgcccaccac cttcttgctg ttcagccaga aatctgccat cagtcggatg 60 
atcccggacg accagcacag cccggatctc atcctgcccc tgcatggact gaggaacgtc 120 
aaagccatcg actatgaccc actggacaag ttcatctact gggtggatgg gcgccagaac 180 
atcaagcgag ccaaggacga cgggacccag gcaggtgccc tgtgg 225 

<210> 53 
<211> 235 
<212> DNA 

< 213 > Homo sapiens 
<400 > 53 

ctttgtctta cagccctttg ttttgaccto tc^agccaa ggccaaaacc cagacaggca 60 

gccccacgac ctcagcatcg acatctacag ccggacactg ttctggacgt gcgaggccac 120 
caataccatc aacgtccaca ggctgagcgg ggaagccatg ggggtggtgc tgcgtgggga 180 
ccgcgacaag cccagggcca tcgtcgtcaa cgcggagcga gggtaggagg ccaac 235 

<210> 54 
<211> 218 
<212> DNA 

< 213 > Homo salens 



<400 > 54 
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ccaccctccc gcaggtacct gtacttcacc aacatgcagg accgggcagc caagatcgaa 
cgcgcagccc tggacggcac cgagcgcgag gtcctcttca ccaccggcct catccgccct 
gtggccctgg tggtggacaa cacactgggc aagctgttct gggtggacgc ggacctgaag 
cgcattgaga gctgtgacct gtcaggtacg cgccccgg 218 



<210> 55 



<211> 234 



<212> DNA 



<213 > Homo sapiens 



<400> 55 

ggctgcttgc aggggccaac cgcctgaccc tggaggacgc caacatcgtg cagcctctgg 60 
gcctgaccat ccttggcaag catctctact ggatcgaccg ccagcagcag atgatcgagc 120 
gtgtggagaa gaccaccggg gacaagcgga ctcgcatcca gggccgtgtc gcccacctca 180 
ctggcatcca tgcagtggag gaagtcagcc tggaggagtt ctgtacgtgg gggc 234 



<210> 56 



<211> 157 



<212> DNA 



<213> Homo sapiens 



<400 > 56 

ttgtctttgc agcagcccac ccatgtgccc gtgacaatgg tggctgctcc cacatctgta 60 
ttgccaaggg tgatgggaca ccacggtgct catgcccagt ccacctcgtg ctcctgcaga 120 
acctgctgac ctgtggaggt aggtgtgacc taggtgc 157 



<210> 57 
<211> 272 



<212> DNA 
<213> Homo sapiens. 
<400> 57 

gttctcctct gtccctcccc cagagccgcc cacctgctcc ccggaccagt ttgcatgtgc 60 
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cacaggggag atcgactgta tccccggggc ctggcgctgt gacggctttc ccgagtgcga 120 
tgaccagagc gacgaggagg gctgccccgt gtgctccgcc gcccagttcc cctgcgcgcg . 180 
gggtcagtgt gtggacctgc gcctgcgctg cgacggcgag gcagactgtc aggaccgctc 240 
agacgaggtg gactgtgacg gtgaggccct cc 272 



<210> 58 
<211> 134 
<212> DNA 
< 213 > Homo sapiens 
<400> 58 

tctccttgca gccatctgcc tgcccaacca gttccggtgt gcgagcggcc agtgtgtcct 60 
catcaaacag cagtgcgact ccttccccga ctgtatcgac ggctccgacg agctcatgtg 120 
tggtgagcca gctt 134 



<210> 59 
<211> 274 
<212> DNA 
<213> Homo sapiens 

<400> 59 

gtttgtctct ggcagaaatc accaagccgc cctcagacga cagcccggcc cacagcagtg 60 
ccatcgggcc cgtcattggc atcatcctct ctctcttcgt catgggtggt gtctattttg 120 
tgtgccagcg cgtggtgtgc cagcgctatg cgggggccaa cgggcccttc ccgcacgagt 180 
atgtcagcgg gaccccgcac gtgcccctca atttcatagc cccgggcggt tcccagcatg 240 
gccccttcac aggtaaggag cctgagatat ggaa 274 



<210> 60 
<211> 164 
<212> DNA 
<213> Homo sapiens 



<400> 60 
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cttccctgcc aggcatcgca tgcggaaagt ccatgatgag ctccgtgagc ctgatggggg 60 
gccggggcgg ggtgcccctc tacgaccgga accacgtcac aggggcctcg tccagcagct 120 
cgtccagcac gaaggccacg ctgtacccgc cggtgagggg cggg 164 



<210> 61 

<211> 130 

<212> DNA 

< 213 > Homo sapiens 

<400 > 61 

ttggctctcc tcagatcctg aacccgccgc cctccccggc cacggacccc tccctgtaca 60 
acatggacat gttctactct tcaaacattc cggccactgc gagaccgtac aggtaggaca 120 
tcccctgcag 

<210> 62 
<211> 496 
<212> DNA 
<213> Homo sapiens 
<400 > 62 

tcaaacattc cggccactgc gagaccgtac aggccctaca tcattcgagg aatggcgccc 60 
ccgacgacgc cctgcagcac cgacgtgtgt gacagcgact acagcgccag ccgctggaag 120 
gccagcaagt actacctgga tttgaactcg gactcagacc cctatccacc cccacccacg 180 
ccccacagcc agtacctgtc ggcggaggac agctgcccgc cctcgcccgc caccgagagg 240 
agctacttcc atctcttccc gccccctccg tccccctgca cggactcatc ctgacctcgg 300 
ccgggccact ctggcttctc tgtgcccctg taaatagttt taaatatgaa caaagaaaaa 360 
aatatatttt atgatttaaa aaataaatat aattgggatt ttaaaaacat gagaaatgtg 420 
aactgtgatg gggtgggcag ggctgggaga actttgtaca gtggagaaat atttataaac 480 
ttaattttgt aaaaca ^"6 
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